Vascular endothelial growth factor-X

ABSTRACT

There is provided a novel vascular endothelial growth factor, herein designated VEGF-X, in addition to the nucleic acid molecule encoding it, a host cell transformed with said vector and compounds which inhibit or enhance angiogenesis. Also provided is the sequence of a CUB domain present in the sequence of VEGF-X which domain itself prevents angiogenesis and which is used to treat diseases associated with inappropriate vascularisation or angiogenesis.

This application claims the benefit of provisional applications No. 60/124,967, filed Mar. 18, 1999 and Ser. No. 60/164,131 filed Nov. 8, 1999.

The present invention is concerned with a novel vascular endothelial growth factor (VEGF) herein designated “VEGF-X”, and characterisation of the nucleic acid and amino acid sequences of VEGF-X.

INTRODUCTION

Angiogenesis involves formation and proliferation of new blood vessels, and is an essential physiological process for normal growth and development of tissues in, for example, embryonic development, tissue regeneration and organ and tissue repair. Angiogenesis also features in the growth of human cancers which require continuous stimulation of blood vessel growth. Abnormal angiogenesis is associated with other diseases such as rheumatoid arthritis psoriasis and diabetic retinopathy.

Capillary vessels consist of endothelial cells which carry the genetic information necessary to proliferate to for capillary networks. Angiogenic molecules which can initiate this process have previously been characterised. A highly selective mitogen for vascular enothelial cells is vascular endothelial growth factor (VEGF) (Ferrara et al., “Vascular Endothelial Growth Factor: Basic Biology and Clinical Implications”. Regulation of angiogenesis, by I. D. Goldberg and E. M. Rosen 1997 Birkhauser Verlag Basle/Switzerland). VEGF is a potent vasoactive protein which is comprised of a glycosylated cationic 46-49 kd dimer having two 24 kd subunits. It is inactivated by sulfhydryl reducing agents and is resistant to acidic pH and to heating and binds to immobilised heparin. VEGF-A has four different forms of 121, 165, 189 and 206 amino acids respectively due to alternative splicing. VEGF121 and VEGF165 are soluble and are capable of promoting angiogenesis, whereas VEGF189 and VEGF206 are bound to heparin containing proteoglycans in the cell surface. The temporal and spatial expression of VEGF has been correlated with physiological proliferation of the blood vessels (Gajdusek, C. M., and Carbon, S. J., Cell Physiol., 139:570-579, (1989)); McNeil, P. L., Muthukrishnan, L., Warder, E., D'Amore, P. A., J. Cell. Biol., 109:811-822, (1989)). Its high affinity binding sites are localized only on endothelial cells in tissue sections (Jakeman, L. B., et al., Clin. Invest. 89:244-253 (1989)). The growth factor can be isolated from pituitary cells and several tumor cell lines, and has been implicated in some human gliomas (Plate, K. H. Nature 359:845-848, (1992)). The inhibition of VEGF function by anti-VEGF monoclonal antibodies was shown to inhibit tumor growth in immune-deficient mice (Kim, K. J., Nature 362:841-844, (1993)).

VEGF proteins have been described in the following patents and applications all of which are hereby incorporated by reference EP-0,506,477, WO-95/24473, WO-98/28621, WO-90/13649, EP-0,476,983, EP-0,550,296, WO-90/13649, WO-96/26736, WO-96/27007, WO-98/49300, WO-98/36075, WO-90/11084, WO-98/24811, WO-98/10071, WO-98/07832, WO-98/02543, WO-97/05250, WO-91/02058, WO-96/39421, WO-96/39515, WO-98/16551.

The present inventors have now identified a further vascular endothelial growth factor, designated herein as “VEGF-X”, and the nucleic acid sequence encoding it, which has potentially significant benefits for the treatment of tumours and other conditions mediated by inappropriate angiogenic activity.

SUMMARY OF THE INVENTION

In the present application, there is provided a novel vascular endothelial growth factor, herein designated “VEGF-X”, nucleic acid molecules encoding said growth factor, an expression vector comprising said nucleic acid molecule, a host cell transformed with said vector and compounds which inhibit or enhance angiogenesis. Also provided is the sequence of a CUB domain present in the sequence of VEGF-X which domain itself prevents angiogenesis and which is used to treat diseases associated with inappropriate vascularisation or angiogenesis.

DETAILED DESCRIPTION OF THE INVENTION

Therefore, according to a first aspect of the present invention there is provided a nucleic acid molecule encoding a VEGF-X protein or a functional equivalent, fragment, derivative or bioprecursor thereof, said protein comprising the amino acid sequence from position 23 to 345 of the amino acid sequence illustrated in FIG. 10. Alternatively, the nucleic acid molecule of the invention encodes the complete sequence identified in FIG. 10 and which advantageously includes a signal peptide to express said protein extracellularly. Preferably, the nucleic acid molecule is a DNA and even more preferably a cDNA molecule. Preferably, the nucleic acid molecule comprises the nucleotide sequence from position 257 to 1291 of the nucleotide sequence illustrated in FIG. 9. In a preferred embodiment the nucleic acid is of mammalian origin and even more preferably of human origin.

In accordance with the present invention a functional equivalent should be taken to mean a protein, or a sequence of amino acids that have similar function to the VEGF-X protein of the invention.

Also provided by this aspect of the present invention is a nucleic acid molecule such as an antisense molecule capable of hybridising to the nucleic acid molecules according to the invention under high stringency conditions, which conditions would be well known to those skilled in the art.

Stringency of hybridisation as used herein refers to conditions under which polynucleic acids are stable. The stability of hybrids is reflected in the melting temperature (Tm) of the hybrids. Tm can be approximated by the formula:

81.5° C.+16.6(log₁₀[Na⁺]+0.41 (%G&C)−600/1

wherein 1 is the length of the hybrids in nucleotides. Tm decreases approximately by 1-1.5° C. with every 1% decrease in sequence homology.

The term “stringency” refers to the hybridisation conditions wherein a single-stranded nucleic acid joins with a complementary strand when the purine or pyrimidine bases therein pair with their corresponding base by hydrogen bonding. High stringency conditions favour homologous base pairing whereas low stringency conditions favour non-homologous base pairing.

“Low stringency” conditions comprise, for example, a temperature of about 37° C. or less, a formamide concentration of less than about 50%, and a moderate to low salt (SSC) concentration; or, alternatively, a temperature of about 50° C. or less, and a moderate to high salt (SSPE) concentration, for example 1M NaCl.

“High stringency” conditions comprise, for example, a temperature of about 42° C. or less, a formamide concentration of less than about 20%, and a low salt (SSC) concentration; or, alternatively, a temperature of about 65° C., or less, and a low salt (SSPE) concentration. For example, high stringency conditions comprise hybridization in 0.5 M NaHPO₄, 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 65° C. (Ausubel, F. M. et al. Current Protocols in Molecular Biology, Vol. I, 1989; Green Inc. New York, at 2.10.3).

“SSC” comprises a hybridization and wash solution. A stock 20×SSC solution contains 3M sodium chloride, 0.3M sodium citrate, pH 7.0.

“SSPE” comprises a hybridization and wash solution. A 1×SSPE solution contains 180 mM NaCl, 9 mM Na₂HPO₄ and 1 mM EDTA, pH 7.4.

The nucleic acid capable of hybridising to nucleic acid molecules according to the invention will generally be at least 70%, preferably at least 80 or 90% and more preferably at least 95% homologous to the nucleotide sequences according to the invention.

The antisense molecule capable of hybridising to the nucleic acid according to the invention may be used as a probe or as a medicament or may be included in a pharmaceutical composition with a pharmaceutically acceptable carrier, diluent or excipient therefor.

The term “homologous” describes the relationship between different nucleic acid molecules or amino acid sequences wherein said sequences or molecules are related by partial identity or similarity at one or more blocks or regions within said molecules or sequences.

The present invention also comprises within its scope proteins or polypeptides encoded by the nucleic acid molecules according to the invention or a functional equivalent, derivative or bioprecursor thereof.

Therefore, according to a further aspect of the present invention, there is provided a VEGF-X protein, or a functional equivalent, derivative or bioprecursor thereof, comprising an amino acid sequence from position 23 to 345 of the sequence as illustrated in FIG. 10, or alternatively which amino acid sequence comprises the complete sequence of FIG. 10. A further aspect of the invention comprises a VEGF-X protein, or a functional equivalent, derivative or bioprecusor thereof, encoded by a nucleic acid molecule according to the invention. Preferably, the VEGF-X protein encoded by said nucleic acid molecule. comprises the sequence from position 23 to 345 of the amino acid sequence as illustrated in FIG. 10, or which sequence alternatively comprises the sequence of amino acids of FIG. 10.

The DNA molecules according to the invention may, advantageously, be included in a suitable expression vector to express VEGF-X encoded therefrom in a suitable host. Incorporation of cloned DNA into a suitable expression vector for subsequent transformation of said cell and subsequent selection of the transformed cells is well known to those skilled in the art as provided in Sambrook et al. (1989), molecular cloning, a laboratory manual, Cold Spring Harbour Laboratory Press.

An expression vector according to the invention includes a vector having a nucleic acid according to the invention operably linked to regulatory sequences, such as promoter regions, that are capable of effecting expression of said DNA fragments. The term “operably linked” refers to a juxta position wherein the components described are in a relationship permitting them to function in their intended manner. Such vectors may be transformed into a suitable host cell to provide for expression of a polypeptide according to the invention. Thus, in a further aspect, the invention provides a process for preparing polypeptides according to the invention which comprises cultivating a host cell, transformed or transfected with an expression vector as described above under conditions to provide for expression by the vector of a coding sequence encoding the polypeptides, and recovering the expressed polypeptides.

The vectors may be, for example, plasmid, virus or phage vectors provided with an origin of replication, and optionally a promoter for the expression of said nucleotide and optionally a regulator of the promoter.

The vectors may contain one or more selectable markers, such as, for example, ampicillin resistance.

Regulatory elements required for expression include promoter sequences to bind RNA polymerase and transcription initiation sequences for ribosome binding. For example, a bacterial expression vector may include a promoter such as the lac promoter and for translation initiation the Shine-Dalgarno sequence and the start codon AUG. Similarly, a eukaryotic expression vector may include a heterologous or homologous promoter for RNA polymerase II, a downstream polyadenylation signal, the start codon AUG, and a termination codon for detachment of the ribosome. Such vectors may be obtained commercially or assembled from the sequences described by methods well known in the art.

Nucleic acid molecules according to the invention may be inserted into the vectors described in an antisense orientation in order to provide for the production of antisense RNA. Antisense RNA or other antisense nucleic acids may be produced by synthetic means.

In accordance with the present invention, a defined nucleic acid includes not only the identical nucleic acid but also any minor base variations including in particular, substitutions in cases which result in a synonymous codon (a different codon specifying the same amino acid residue) due to the degenerate code in conservative amino acid substitutions. The term “nucleic acid sequence” also includes the complementary sequence to any single stranded sequence given regarding base variations.

The present invention also advantageously provides nucleic acid sequences of at least approximately 10 contiguous nucleotides of a nucleic acid according to the invention and preferably from 10 to 50 nucleotides even more preferably, the nucleic acid sequence comprise the sequences illustrated in FIG. 3. These sequences may, advantageously be used as probes or primers to initiate replication, or the like. Such nucleic acid sequences may be produced according to techniques well known in the art, such as by recombinant or synthetic means. They may also be used in diagnostic kits or the like for detecting the presence of a nucleic acid according to the invention. These tests generally comprise contacting the probe with the sample under hybridising conditions and detecting for the presence of any duplex or triplex formation between the probe and any nucleic acid in the sample.

The nucleic acid sequences according to this aspect of the present invention comprise the sequences of nucleotides illustrated in FIGS. 3 and 5.

According to the present invention these probes may be anchored to a solid support. Preferably, they are present on an array so that multiple probes can simultaneously hybridize to a single biological sample. The probes can be spotted onto the array or synthesised In situ on the array. (See Lockhart et al., Nature Biotechnology, vol. 14, December 1996 “Expression monitoring by hybridisation to high density oligonucleotide arrays”. A single array can contain more than 100, 500 or even 1,000 different probes in discrete locations.

The nucleic acid sequences, according to the invention may be produced using such recombinant or synthetic means, such as for example using PCR cloning mechanisms which generally involve making a pair of primers, which may be from approximately 10 to 50 nucleotides to a region of the gene which is desired to be cloned, bringing the primers into contact with mRNA, cDNA, or genomic DNA from a human cell, performing a polymerase chain reaction under conditions which brings about amplification of the desired region, isolating the amplified region or fragment and recovering the amplified DNA. Generally, such techniques are well known in the art, such as described in Sambrook et al. (Molecular Cloning: a Laboratory Manual, 1989).

The nucleic acids or oligonucleotides according to the invention may carry a revealing label. Suitable labels include radioisotopes such as ³²P or ³⁵S, enzyme labels or other protein labels such as biotin or fluorescent markers. Such labels may be added to the nucleic acids or oligonucleotides of the invention and may be detected using known techniques per se.

Advantageously, human allelic variants or polymorphisms of the DNA molecule according to the invention may be identified by, for example, probing cDNA or genomic libraries from a range of individuals, for example, from different populations. Furthermore, nucleic acids and probes according to the invention may be used to sequence genomic DNA from patients using techniques well known in the art, such as the Sanger Dideoxy chain termination method, which may, advantageously, ascertain any predisposition of a patient to certain disorders associated with a growth factor according to the invention.

The protein according to the invention includes all possible amino acid variants encoded by the nucleic acid molecule according to the invention including a polypeptide encoded by said molecule and having conservative amino acid changes. Conservative amino acid substitution refers to a replacement of one or more amino acids in a protein as identified in Table 1. Proteins or polypeptides according to the invention further include variants of such sequences, including naturally occurring allelic variants which are substantially homologous to said proteins or polypeptides. In this context, substantial homology is regarded as a sequence which has at least 70%, preferably 80 or 90% and preferably 95% amino acid homology with the proteins or polypeptides encoded by the nucleic acid molecules according to the invention. The protein according to the invention may be recombinant, synthetic or naturally occurring, but is preferably recombinant.

The nucleic acid or protein according to the invention may be used as a medicament or in the preparation of a medicament for treating cancer or other diseases or conditions associated with expression of VEGF-X protein.

Advantageously, the nucleic acid molecule or the protein according to the invention may be provided in a pharmaceutical composition together with a pharmacologically acceptable carrier, diluent or excipient therefor.

The present invention is further directed to inhibiting VEGF-X in vivo by the use of antisense technology. Antisense technology can be used to control gene expression through triple-helix formation of antisense DNA or RNA, both of which methods are based on binding of a polynucleotide to DNA or RNA. For example, the 5′ coding portion or the mature DNA sequence, which encodes for the protein of the present invention, is used to design an antisense RNA oligonucleotide of from 10 to 50 base pairs in length. A DNA oligonucleotide is designed to be complementary to a region of the gene involved in transcription (triple-helix—see Lee et al. Nucl. Acids Res., 6:3073 (1979); Cooney et al., Science, 241:456 (1988); and Dervan et al., Science, 251: 1360 (1991), thereby preventing transcription and the production of VEGF-X. The antisense RNA oligonucleotide hybridises to the mRNA in vivo and blocks translation of an mRNA molecule into the VEGF-X protein (antisense—Okano, J. Neurochem., 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, Fla. (1988)).

Alternatively, the oligonucleotide described above can be delivered to cells by procedures in the art such that the anti-sense RNA and DNA may be expressed in vivo to inhibit production of VEGF-X in the manner described above.

Antisense constructs to VEGF-X, therefore, may inhibit the angiogenic activity of VEGF-X and prevent the further growth of or even regress solid tumours, since angiogenesis and neovascularization are essential steps in solid tumour growth. These antisense constructs may also be used to treat rheumatoid arthritis, psoriasis and diabetic retinopathy which are all characterized by abnormal angiogenesis.

A further aspect of the invention provides a host cell or organism, transformed or transfected with an expression vector according to the invention. The host cell or organism may advantageously be used in a method of producing VEGF-X, which comprises recovering any expressed VEGF-X from the host or organism transformed or transfected with the expression vector.

According to a further aspect of the invention there is also provided a transgenic cell, tissue or organism comprising a transgene capable of expressing VEGF-X protein according to the invention. The term “transgene capable of expression” as used herein means a suitable nucleic acid sequence which leads to expression of VEGF-X or proteins having the same function and/or activity. The transgene, may include, for example, genomic nucleic acid isolated from human cells or synthetic nucleic acid, including DNA integrated into the genome or in an extrachromosomal state. Preferably, the transgene comprises the nucleic acid sequence encoding the proteins according to the invention as described herein, or a functional fragment of said nucleic acid. A functional fragment of said nucleic acid should be taken to mean a fragment of the gene comprising said nucleic acid coding for the proteins according to the invention or a functional equivalent, derivative or a non-functional derivative such as a dominant negative mutant, or bioprecursor of said proteins. For example, it would be readily apparent to persons skilled in the art that nucleotide substitutions or deletions may be used using routine techniques, which do not affect the protein sequence encoded by said nucleic acid, or which encode a functional protein according to the invention.

VEGF-X protein expressed by said transgenic cell, tissue or organism or a functional equivalent or bioprecursor of said protein also forms part of the present invention.

Antibodies to the protein or polypeptide of the present invention may, advantageously, be prepared by techniques which are known in the art. For example, polyclonal antibodies may be prepared by inoculating a host animal, such as a mouse or rabbit, with the polypeptide according to the invention or an epitope thereof and recovering immune serum. Monoclonal antibodies may be prepared according to known techniques such as described by Kohler R. and Milstein C., Nature (1975) 256, 495-497. Advantageously, such antibodies may be included in a kit for identifying VEGF-X in a sample, together with means for contacting the antibody with the sample.

Advantageously, the antibody according to the invention may also be used as a medicament or in the preparation of a medicament for treating tumours or other diseases associated with expression of VEGF-X.

The invention also further provides a pharmaceutical composition comprising said antibody together with a pharmaceutically acceptable carrier diluent or excipient therefor.

Proteins which interact with the polypeptide of the invention may be identified by investigating protein-interactions using the two-hybrid vector system first proposed by Chien et al., (1991) Proc. Natl. Acad. Sci. USA 88:9578-9582.

This technique is based on functional reconstitution in vivo of a transcription factor which activates a reporter gene. More particularly the technique comprises providing an appropriate host cell with a DNA construct comprising a reporter gene under the control of a promoter regulated by a transcription factor having a DNA binding domain and an activating domain, expressing in the host cell a first hybrid DNA sequence encoding a first fusion of a fragment or all of a nucleic acid sequence according to the invention and either said DNA binding domain or said activating domain of the transcription factor, expressing in the host at least one second hybrid DNA sequence, such as a library or the like, encoding putative binding proteins to be investigated together with the DNA binding or activating domain of the transcription factor which is not incorporated in the first fusion; detecting any binding of the proteins to be investigated with a protein according to the invention by detecting for the presence of any reporter gene product in the host cell; optionally isolating second hybrid DNA sequences encoding the binding protein.

An example of such a technique utilises the GAL4 protein in yeast. GAL4 is a transcriptional activator of galactose metabolism in yeast and has a separate domain for binding to activators upstream of the galactose metabolising genes as well as a protein binding domain. Nucleotide vectors may be constructed, one of which comprises the nucleotide residues encoding the DNA binding domain of GAL4. These binding domain residues may be fused to a known protein encoding sequence, such as for example, the nucleic acids according to the invention. The other vector comprises the residues encoding the protein binding domain of GAL4. These residues are fused to residues encoding a test protein. Any interaction between polypeptides encoded by the nucleic acid according to the invention and the protein to be tested leads to transcriptional activation of a reporter molecule in a GAL-4 transcription deficient yeast cell into which the vectors have been transformed. Preferably, a reporter molecule such as β-galactosidase is activated upon restoration of transcription of the yeast galactose metabolism genes.

A further aspect of the present invention also provides a method of identifying VEGF-X in a sample, which method comprises contacting said sample with an antibody according to the invention and monitoring for any binding of any proteins to said antibody. A kit for identifying the presence of VEGF-X in a sample is also provided comprising an antibody according to the invention and means for contacting said antibody with said sample.

VEGF-X may be recovered and purified from recombinant cell cultures by methods known in the art, including ammonium sulfate or ethanol precipitation, acid extraction, anion or cation exchange chromatography, phosphocellulose chromatography, hydrophobic interaction chromatography, affinity chromatography, hydroxyapatite chromatography and lectin chromatography.

The VEGF-X protein of the present invention may be a naturally purified product, or a product of chemical synthetic procedures, or produced by recombinant techniques from a prokaryotic or eukaryotic host (for example, by bacterial yeast, higher plant, insect and mammalian cells in culture). Depending upon the host employed in a recombinant production procedure, the polypeptides of the present invention may be glycosylated with mammalian or other eukaryotic carbohydrates or may be non-glycosylated.

VEGF-X is particularly advantageous as a wound healing agent, where, for example, it is necessary to re-vascularize damaged tissues, or where new capillary angiogenesis is important. Accordingly, VEGF-X may be used for treatment of various types of wounds such as for example, dermal ulcers, including pressure sores, venous ulcers, and diabetic ulcers. In addition, it can be used in the treatment of full-thickness burns and injuries where angiogenesis is desired to prepare the burn in injured sites for a skin graft and flap. In this case, VEGF-X or the nucleic acid encoding it may be applied directly to the wound. VEGF-X may be used in plastic surgery when reconstruction is required following a burn, other trauma, or even for cosmetic purposes.

An important application of VEGF-X is to induce the growth of damaged bone, periodontium or ligament tissue. For example, it may be used in periodontal disease where VEGF-X is applied to the roots of the diseased teeth, leading to the formation of new bore and cementum with collagen fibre ingrowths. It can be used for regenerating supporting tissues of teeth, including alveolar bone, cementum and periodontal ligament, that have been damaged by disease and trauma.

Since angiogenesis is important in keeping wounds clean and non-infected, VEGF-X may be used in association with surgery and following the repair of cuts. It should be particularly useful in the treatment of abdominal wounds where there is a high risk of infection.

VEGF-X can also be used for the promotion of endothelialization in vascular graft surgery. In the case of vascular grafts using either transplanted or synthetic material, VEGF-X may be applied to the surface of the graft or at the junction to promote the growth of the vascular endothelial cells. One derivation of this is that VEGF-X can be used to repair the damage of myocardial and other occasions where coronary bypass surgery is needed by stimulating the growth of the transplanted tissue. Related to this is the use of VEGF-X to repair the cardiac vascular system after ischemia.

The protein of the present invention may also be employed in accordance with the present invention by expression of such protein in vivo, which is often referred to as “gene therapy”.

Thus, for example, cells such as bone marrow cells may be engineered with a polynucleotide (DNA or RNA) encoding for the protein ex vivo as defined herein, the engineered cells are then provided to a patient to be treated with the polypeptide. Such methods are well-known in the art. For example, calls may be engineered by procedures known in the art by use of a retroviral particle containing RNA encoding for the protein of the present invention.

Similarly, cells may be engineered in vivo for expression of the protein in vivo, for example, by procedures known in the art.

A further aspect of the invention comprises a method of treating a disorder mediated by expression of a protein according to the invention, by administering to a patient an amount of an antisense molecule as described herein, in sufficient concentration to alleviate or reduce the symptoms of said disorder.

Compounds which inhibit or enhance angiogenesis may be identified by providing a host cell or organism according to the invention or a transgenic cell, tissue or organism according to the invention, contacting a test compound with said cell, tissue or organism and monitoring for the effect of said compound compared to a cell tissue or organism which has not been contacted with said compound. These compounds may themselves be used as a medicament or included in a pharmaceutical composition for treatment of disorders mediated by inappropriate vascularisation or angiogenic activity.

The present inventors have also, advantageously, identified in the sequence encoding the VEGF-X protein a CUB domain, which has heretofore not previously been identified in VEGF-type growth factors. The VEGF-X protein may therefore exert dual regulatory effects via interaction with the VEGF tyrosine kinase receptors or with neuropilin receptors mediated by the CUB domain. Thus, the sequence encoding said CUB domain may be included in an expression vector for subsequent transformation of a host cell, tissue or organism.

VEGF-X or fragments thereof may be able to modulate the effects of pro-angiogenic growth factors such as VEGF as indicated in the findings presented in the examples below that the N-terminal part of the VEGF-X protein, a CUB-like domain, is able to inhibit VEGF-stimulated proliferation of HUVECs. VEGF-X or fragments thereof may therefore be useful in therapy of conditions involving inappropriate angiogenesis. Inhibition of the angiogenic activity of VEGF has been linked with inhibition of tumour growth in several models eg Kim K. J. et al, Nature 362:841-844, (1993). Additionally, agents able to inhibit angiogenesis would be expected to be useful in treating other angiogenesis-dependent diseases such a retinopathy, osteoarthritis and psoriasis (Folkman, J., Nature Medicine 1:27-31, (1995).

As identified in more detail in the Examples described herein the present inventors have surprisingly identified that the CUB domain of VEGF-X is able to inhibit stimulation of proliferation of HUVECs induced by either VEGF or bFGF. The CUB domain may, therefore, be utilised as a therapeutic agent for inhibition of angiogenesis and for treatment of condition associated with inappropriate vascularisation or angiogenesis.

Therefore according to a further aspect of the invention there is provided a method of inhibiting angiogenic activity and inappropriate vascularisation including formation and proliferation of new blood vessels, growth and development of tissues, tissue regeneration and organ and tissue repair in a subject said method comprising administering to said subject an amount of a polypeptide having an amino acid sequence from position 40 to 150 of the sequence illustrated in FIG. 10 or a nucleic acid molecule encoding the CUB domain according to the invention in sufficient concentration to reduce or prevent said angiogenic activity.

Furthermore there is also provided a method of treating or preventing any of cancer, rheumatoid arthritis, psoriasis and diabetic retinopathy, said method comprising administering to said subject an amount of a polypeptide having an amino acid sequence from position 40 to 150 of the sequence illustrated in FIG. 10 or a nucleic acid molecule encoding the CUB domain according to the invention in sufficient concentration to treat or prevent said disorders.

The CUB domain may also be used to identify compounds that inhibit or enhance angiogenic activity such as inappropriate vascularisation, in a method comprising contacting a cell expressing a VEGF receptor and/or a neuropilin 1 or 2 type receptor with said compound in the presence of a VEGF-X protein according to the invention and monitoring for the effect of said compound or said cell when compared to a cell which has not been contacted with said compound. Such compounds may then be used as appropriate to prevent or inhibit angiogenic activity to treat the disorders or conditions described herein, or in a pharmaceutical composition. An antibody to said CUB domain may also be useful in identifying other proteins having said sequences.

Deposited Plasmids Date of Deposit Accession No. Plasmid VEGFX/pCR2.1 Mar. 1, 1999 LMBP 3925 1TOPO FL Plasmid VEGFX/pRSETB BD Mar. 1, 1999 LMBP 3926 amino acids G230-G245 Plasmid VEGFX/pcR.2.1 Oct. 20, 1999 LMBP 3977 FL Clone 9 Plasmid VEGF-X CUB Dec. 20, 1999 LMBP 3991 PET22b

The above plasmids were deposited at the Belgian Coordinated Collections of Microorganisms (BCCM) at Laboratorium Voor Moleculaire Biologie-Plasmidencollectie (LMBP) B-9000, Ghent, Belgium, in accordance with the provisions of the Budapest Treaty of 28 Apr. 1977.

The invention may be more clearly understood with reference to the accompanying example, which is purely exemplary, with reference to the accompanying drawings, wherein:

FIG. 1: is a DNA sequence (SEQ ID NO:98) identified in the Incyte LifeSeq™ database coding for a novel VEGF-X protein.

FIG. 2: is an illustration of amino acid sequence (SEQ ID NO:99) of the nucleic acid sequence of FIG. 1.

FIG. 3: is an illustration of PCT primer sequences (SEQ ID NOs:4-13) utilised to identify the VEGF-X protein according to the invention.

FIG. 4: is a diagrammatic illustration of the spatial relationships in the VEGF-X sequence of the clones identified using the PCR primer sequences of FIG. 3.

FIG. 5: is an illustration of the nucleotide sequences of the 5′ RACE primers (SEQ ID NOs:14-17) used to identify the 5′ end of the VEGF-X open reading frame.

FIG. 6: is an illustration of the sequence (SEQ ID NO:100 and SEQ ID NO:101) obtained from the RACE experiment.

FIG. 7: is an illustration of the nucleotide sequences (SEQ ID NO:102 AND SEQ ID NO:103) obtained from the search of LifeSeq™ database using the sequence in FIG. 6.

FIG. 8: is an illustration of the primers used to clone the entire coding sequence of VEGF-X.

FIG. 9: is an illustration of the entire coding sequence (SEQ ID NO:104) of VEGF-X.

FIG. 10: is an illustration of the predicted amino acid sequence (SEQ ID NO:2) of the nucleotide sequence of FIG. 9. SEQ ID NO:1is amino acids 23-345 of SEQ ID NO:2.

FIG. 11: is an alignment of the sequence of FIG. 10 with the sequences of VEGF-A to D

FIG. 12: is an illustration of variant sequences (SEQ ID NOs:110-112) of the VEGF-X protein according to the invention.

FIG. 13: is an illustration of the oligonucleotide primers used for E.coli expression of VEGF-X domains and for expression of the full length sequence of VEGF-X in a baculovirus/insect cell expression system.

FIG. 14: depicts nucleic acid sequences (SEQ ID NOs:30-47) of 18 human EST clones obtained from a BLAST search of the LifeSeq™ database used to identify the full sequence encoding VEGF-X.

FIG. 15: depicts the nucleotide sequences (SEQ ID NOs:48-97) of 50 human EST clones obtained from the LifeSeq™ database.

FIG. 16: is an illustration of nucleotide sequences utilised as primers to identify the nucleotide sequence encoding VEGF-X.

FIG. 17: is a nucleotide sequence (SEQ ID NO:113 and SEQ ID NO:114) coding for a partial VEGF-X protein according to the invention.

FIG. 18: is an illustration of a partial nucleotide sequence encoding (SEQ ID NO:115 and SEQ ID NO:116) VEGF-X protein according to the invention.

FIG. 19: is an illustration of a DNA (SEQ ID NO:117) and polypeptide sequence (SEQ ID NO:118) used for mammalian cell expression of VEGF-X. The predicted VEGF-X signal sequence is in lower case letters. The C-terminal V5 epitope and His6 sequences are underlined.

FIG. 20: is an illustration of a DNA (SEQ ID NO:119) and polypeptide sequence (SEQ ID NO:120) used for baculovirus/insect cell expression of VEGF-X. In the polypeptide sequence the signal sequence is shown in lower case. The N-terminal peptide tag added to the predicted mature VEGF-X sequence is underlined.

FIG. 21: is an illustration of a DNA (SEQ ID NO:121) and polypeptide sequence (SEQ ID NO:122) used for E. coli expression of VEGF-X. The polypeptide sequences at the N- and C-termini derived from the MBP fusion and His6 tag respectively are underlined.

FIG. 22: illustrates the disulphide-linked dimerisation of VEGF-X. Protein samples were analysed by SDS-PAGE. Prior to loading the gel, samples were heated to 95° C. for 5 minutes in sample buffer in the presence (+) or absence (−) of reducing agent. (A) samples from COS cell expression of a C-terminally V5/His6 peptide-tagged construct. The left hand panel is total conditioned medium, the right hand panel is material purified on Nickel agarose resin. Reduced monomer and putative disulphide-linked, non-reduced dimer are indicated by arrows. There appears to be proteolysis of the protein during purification. Gels were blotted onto nylon membranes and protein detected with an anti V5 monoclonal antibody. (B) Samples from E. coli expression of a maltose-binding protein/His6 dual fusion construct. M indicates the molecular weight markers (Benchmark, LifeTechnologies). The gel was stained with Coomassie Blue by standard procedures. The fusion protein has an apparent molecular weight of 80 kDa.

FIG. 23: illustrates the glycosylation of VEGF-X. VEGF-X was purified from the culture supernatant of COS cells transfected with the pcDNA6/V5-His construct. Supernatants were harvested 72 h post-transfection and purified on nickel resin. Samples were then treated with EndoH (+) or untreated (−) before SDS-PAGE and blotting, as described in the legend to FIG. 22.

FIG. 24: is an illustration of the DNA (SEQ ID NO:123) and polypeptide sequence (SEQ ID NO:124) used for E. coli expression of the VEGF-like domain of VEGF-X. Polypeptide sequences at the N-terminus of the protein derived from the vector are underlined.

FIG. 25: shows expression of the VEGF-X VEGF domain in E. coli. Lane 1-10 μl broad range marker (New England Biolabs), lane 2-10 μl unreduced sample, lane 3-10 μl reduced sample. The reduced PDGF domain protein (lane 3) has an apparent molecular weight of approximately 19 kDa on SDS-PAGE.

FIG. 26: illustrates a DNA (SEQ ID NO:125) and polypeptide sequence (SEQ ID NO:126) used for E. coli expression of the CUB-like domain of VEGF-X. The polypeptide sequence at the N-terminus derived from the vector-encoded signal and the introduced His6 tag are underlined.

FIG. 27: shows expression of the VEGF-X CUB domain in E. coli. The CUB domain protein was purified on Nickel chelate resin. The protein migrates at approximately 23 kDa on SDS-PAGE.

FIG. 28: illustrates the effect of truncated VEGF-X (CUB domain) on HUVEC proliferation. (A) Human Umbilical Vein Endothelial Cells (one-day-treatment). (B) Human Umbilical Vein Endothelial Cells (24-hour starving followed by one-day-treatment). (C) Effect of VEGF-A₁₆₅ and VEGF-X CUB domain on the proliferation of HUVEC (two-day-treatment).

FIG. 29: depicts the tissue distribution of VEGF-X mRNA analysed by Northern blotting and RT-PCR in (A) normal tissues and (B) tumour tissue and cell lines.

FIG. 30: depicts the partial intron/exon structure of the VEGF-X gene. (A) Genomic DNA sequences of 2 exons (SEQ ID NO:127 and SEQ ID NO:128) determined by sequencing; exon sequence is in upper case, intron sequence is in lower case. (B) Shows the location of splice sites within the VEGF-X cDNA sequence (SEQ ID NO:129 and SEQ ID NO:130). The location of mRNA splicing events is indicated by vertical lines. The cryptic splice donor/acceptor site at nt. 998/999 (diagonal lines) gives rise to the splice variant forms of VEGF-X. No splice site information is given for the region shown in italics.

FIG. 31: is a graphic representation of the effect of FL-VEGF-X on HuVEC proliferation: (24 hour serum starvation followed by one day treatment).

FIG. 32: is a graphic representation of the combined effect of truncated VEGF-X (CUB domain) and human recombinant VEGF₁₆₅ on HUVEC proliferation: (24 hour serum starvation followed by two day treatment).

FIG. 33: is a graphic representation of the combined effect of the CUB domain and human recombinant bFGF on NuVEC proliferation: (24 hour serum starvation followed by two day treatment).

FIG. 34: is a graphic representation of the results of a LDH assay fox testing cytotoxicity of the CUB domain or the CUB domain with rhVEGF₁₆₅.

FIG. 35: is a graphic representation of the results obtained from a LDH assay for testing cytotoxicity of the CUB domain or CUB domain with rh-bFGF.

A BLAST (Basic Local Alignment Search Tool; Altschul et al., 1990 J. Mol. Biol. 215, 403-410) search was performed in the proprietary LifeSeq™ human EST database (Incyte Pharmaceuticals, Inc., Palo Alto, Calif., USA). BLAST produces alignments of both nucleotide and amino acid sequences to determine sequence similarity. Because of the local nature of the alignments, BLAST is especially useful in determining exact matches or in identifying homologues. While it is useful for matches which do not contain gaps, it is inappropriate for performing motif-style searching. The fundamental unit of BLAST algorithm output is the High-scoring Segment Pair (HSP).

Eighteen human EST clones (FIG. 14) with high similarity to the previously identified VEGF proteins were identified and a further fifty EST clones (FIG. 15) were identified using these sequences as query sequences, allowing us to deduce the putative sequence for the new VEGF-X protein. The sequences obtained were compared to known sequences to determine regions of homology and to identify the sequence as a novel VEGF-type protein. Using the DNA sequence information in the databases we were able to prepare suitable primers having the sequences of VEGF-X 1-10 illustrated in FIG. 3 for use in subsequent RACE experiments to obtain the complete DNA sequence for the VEGF-X gene.

Cloning

A profile was developed based on the VEGF-like domain. in existing VEGF sequences (VEGF-A, B, C and D). This was used to search the public databases and the incyte LifeSeq™ database. No significant novel matching sequences were found in the public databases. All of the matching sequences found in the LifeSeq™ database (˜1000) were assembled to give a smaller number of sequences (˜30), which included the known VEGFs and a potential novel VEGF (FIGS. 1 and 2). This sequence was named VEGF-X.

Oligonucleotides were designed to amplify the VEGF-X sequence from cDNA (FIG. 3). The ESTs found in LifeSeq™ were from a range of tissues, with a slight predominance of sequences from ovary, testis, placenta and lung (FIGS. 14 and 15). Accordingly the oligonucleotides were used to amplify cDNA derived from lung and placenta. First-round PCR products were found at ˜200 bp larger than the expected sizes, while 3 major species appeared after a second round of PCR amplification, the smallest of which was of the expected size. These fragments were cloned and sequenced. The smallest fragment did indeed have the sequence originally identified from the LifeSeq database, while the others contained insertions (FIG. 4).

As the first round of amplification suggested that the major species found in cDNA from ovary and placenta was not that originally identified in the LifeSeq™ database, the focus of effort was switched to the presumed major species (it seemed likely that clones 57, 25-27 and 2.1 kb clones 1-3 in FIG. 4 represented the major mRNA species). Conceptual translation of the DNA sequences of these cloned PCR fragments indicated that the complete open reading frame was not present in the clones or in the sequence from LifeSeq™. While all clones contained the same sequence in the region of the translation termination codon, indicating that the end of the open reading frame had been identified, the 5′ end of the open reading frame had not been cloned. 5′ RACE experiments were therefore carried out in order to find the start of the reading frame. PCR primers designed for RACE experiments are shown in FIG. 5. RACE PCR products were sequenced directly. Sequence could be obtained from the 3′ end of these RACE products but not from the 5′ end; probably because the products were not cloned and were therefore heterogeneous at the 5′ end. This new sequence was assembled with the existing cloned sequence to give the sequence shown in FIG. 6. Searching the LifeSeq™ database with this sequence identifies ESTs which extend the sequence a further 140 bp in the 5′ direction and a further 160 bp in the 3′ direction (FIG. 7). This longer contig was used to design oligonucleotide primers to amplify the entire coding sequence (these primer sequences are shown in FIG. 8). PCR was carried out using primers 5′-1 and vegfX10 (in order to clone a “full-length” cDNA), and with primers 5′-1 and vegfX6 (in order to clone the full coding region, see FIG. 3 for sequences of vegfX10 and vegfX6). A number of clones were obtained for the shorter fragment, of which clones 4 and 7 contain no PCR errors (sequence of clones 4 & 7 in FIG. 9). A single clone was obtained for the longer fragment (clone 9), but this sequence appears to contain 2 PCR errors.

The predicted polypeptide from these longer contigs is shown in FIG. 10. Amino acids 1-22 are predicted to encode a signal sequence (von Heijne, 1986, Nucleic Acids Res. 14, 4683-4690). FIG. 11 shows an alignment of the protein sequence with VEGFs A-D. The region homologous to the other VEGFs is located towards the C-terminus of the protein. As the VEGF homology domain is expected to belong to the TGF-beta superfamily of growth factors and to consist of a dimer containing both intra- and intermolecular disulphide bonds, initial alignments focussed on the cysteines. However, mapping of the sequence onto the known x-ray structure of the VEGF-A receptor-binding domain (Muller et al (1997) Proc. Natl. Acad. Sci USA 94, 7192-7197) suggests that the alignment in FIG. 11 is plausible, as the extra 4 cysteine residues within the VEGF-homology region of VEGF-X (compared to this region of VEGF-A) correspond to residues which are spatially close in VEGF-A, and may therefore be able to form disulphide bonds.

A search of the PFAM database of protein domains with the full-length polypeptide sequence from FIG. 10 identifies two domain consensus sequences within the polypeptide. The more C-terminal domain is a “VEGF” domain: (the known VEGFs all contain this domain and the structure of this region of VEGF-A is similar to that of PDGF). Additionally towards the N-terminus of the polypeptide there is a CUB domain (amino acids ˜40-150). The CUB domain is a 100-110 amino acid extracellular domain found in a number of developmentally-regulated proteins. When the full-length protein is used to search the protein databases using the BLAST 2 algorithm, the scores for matches to CUB domain-containing proteins are more significant than those to the other VEGFs. Interestingly, the most significant matches are to the CUB domains of Neuropilins, and Neuropilin-1 was recently identified as a receptor of one of the VEGF-A isoforms VEGF-A₁₆₅ (Soker et al. (1998) Cell 92, 735-795).

Assuming that the variant sequences isolated by PCR (i.e. the smaller PCR fragments) use the same translation initiation site as the full-length sequence, they would result in production of the variant proteins shown in FIG. 12. It may be significant that both of these variant proteins retain the CUB domain and delete all or part of the VEGF-like domain. The production of these variant sequences can be explained by the use of a cryptic splice donor/acceptor site within the VEGF-X sequence (FIG. 30B, between nt. 998/999): one variant arises by splicing out of the region between nt. 729-998, the other by splicing out of the region between nt. 999-1187.

Expression

Full-length Expression Constructs

Mammalian Cells

Clone 4 containing the full CDS of VEGF-X (see FIG. 9), was used to generate constructs for expression of full-length protein. The sequence was amplified by PCR and cloned into the vector pCDNA6/V5-His so as to add a C-terminal V5 epitope tag and His₆ tag. The DNA and polypeptide sequence in this vector is shown in FIG. 19. Transient expression in COS cells followed by western blotting and detection via an anti-V5 mAb demonstrates the secretion of a protein of ˜50K into the medium in transfected cells only (FIG. 22A). This construct can also be used to generate VEGF-X expressing stable CHO cell lines.

Baculovirus/Insect-cell Expression System

For expression in the baculovirus/insect cell system the DNA encoding the predicted mature VEGF-X polypeptide sequence was fused to a sequence encoding a signal derived from melittin, a secreted insect protein. An N-terminal 6His tag was also added to facilitate purification. The insert was then cloned into the baculovirus expression vector pFASTBAC. The DNA and polypeptide sequence of this construct is shown in FIG. 20. Infection of Trichoplusia ni Hi5 cells with this recombinant baculovirus results in the secretion of a protein of approximately 45K into the medium (data not shown).

E.coli.

The coding region of VEGF-X has been cloned in a variety of ways for expression as a secreted protein in E.coli. A particularly useful expression clone carries an N-terminal fusion to the E.coli maltose-binding protein (MBP-derived from the expression vector pMAL-p2, New England Biolabs) and a C-terminal fusion to a 6His tag. The DNA and polypeptide sequence of this vector is shown in FIG. 21. Sequential purification of cell fractions on NI-NTA resin and amylose resin allows the isolation of the expressed protein (see FIG. 22B).

Expression of Fragments

VFGF

The VEGF domain of VEGF-X has beer expressed in E.coli. Similar domains from VEGF-A (Christinger et al. (1996) PROTEINS: Structure, Function and Genetics 26, 353-357), and VEGF-D (Achen et al (1998) Proc. Natl. Acad. Sci USA 95, 548-553) have been shown to be capable of binding to the respective receptors.

Expression of these domains was carried out using the bacterium E.coli. Additionally, the full-length protein was expressed using the baculovirus/insect cell expression system. The oligonucleotide primers which have been obtained for these experiments are shown in FIG. 13. The construct directed expression in the bacterial cytoplasm, and as expected the protein was produced in insoluble form in inclusion bodies (the DNA and polypeptide sequence used for PDGF domain expression is shown in FIG. 24). Inclusion bodies were washed, solubilized with urea and the protein purified under denaturing conditions, before refolding by dialysis to remove the urea. Soluble protein was obtained, but shows little evidence of the disulphide bond linked dimers seen with material derived from animal cells (FIG. 25, compare with FIGS. 22A & B). It is not clear therefore whether this protein is correctly folded.

CUB

The CUB domain has been expressed as a soluble secreted protein in E.coli (FIG. 26). The protein was purified by binding to Ni-NTA resin (FIG. 27) and assayed for activity on HUVECs in an in-vitro proliferation assay.

Properties of the VEGF-X Protein

The transient mammalian cell expression system described above has been used to generate full-length VEGF-X protein, as shown by antibody detection following Western blotting (see FIG. 22A).

Disulphide Bond Linked Dimers

The other members of the PDGF family of growth factors, the PDGFs and VEGFs, all exist as dimers in which two monomers constituting the dimer are linked by interchain disulphide bonds. The x-ray structures of PDGF-BB (Oefner et al, 1992), and VEGF-A (Muller et al, 1997) are known and indicate that at least these two members of the family contain two interchain disulphide bonds. Practically this means that in SDS-PAGE analysis of these growth factors the presence of interchain disulphide bonds is shown by a large decrease in mobility in the absence of reducing agent (ie. the nonreduced dimer migrates more slowly through the gel than the reduced monomer). This effect was also expected for VEGF-X, and has been demonstrated for the material obtained from transient mammalian cell expression (FIG. 22A). In the case of the full length material produced in E.coll only some 10% of the total VEGF-X protein appears to be present as disulphide bond-linked dimers (FIG. 22B). However, these results provide evidence that the mammalian cell-derived protein is correctly folded, and that a portion of the E.coli-derived protein is too.

Glycosylation

There are 3 predicted potential N-linked glycosylation sites within the VEGF-X protein: at residues 25, 55 and 254 of the polypeptide sequence. The predicted molecular mass of the mature VEGF-X protein is 40 kDa, but SDS-PAGE and western blotting (detection via an introduced C-terminal epitope tag—see FIG. 19) of the full-length protein expressed in COS cells gives a band slightly larger than the expected size (45-50 kDa) as well as one at 25 kDa (FIG. 22A). This smaller band is presumed to be a C-terminal proteolysis fragment derived from the full-length molecule (controls from uninfected cells do not show this band), probably corresponding to a cleavage between the CUB and VEGF domains. EndoH treatment of the preparation gives a slight mobility change for the full-length protein (FIG. 23), but for the smaller VEGF domain fragment there is a clear change, indicating that the predicted glycosylation-site within the VEGF domain at residue 254 is indeed glycosylated;

Activity of Proteins in Cell-based Assays

Protein samples were tested for activity in cell proliferation, cell migration and in-vitro angiogenesis assays. Active samples can also be tested in the in vivo matrigel mouse model of angiogenesis.

Full-length VEGF-X Protein

Conditioned medium derived from COS cells transiently expressing VEGF-X (see FIG. 22A) displayed no detectable activity in any of the assays. However, as VEGF-X protein could only be detected in this preparation by Western blotting, and not by Coomassie-staining of gels, it is clearly present at very low levels and this may be the reason for the observed lack of activity in the cell proliferation, migration or in vitro angiogenesis tests.

VEGF Domain

The VEGF domain protein described above has been tested in cell proliferation (on a range of cell types), cell migration and in vitro angiogenesis assays and has failed to show activity in any of these tests. As suggested above, this may be due to incorrect folding of this protein.

CUB Domain

The CUB domain protein at the highest dose tested (1 μg/ml) appears to inhibit proliferation of HUVECs in the absence of other stimulation (FIGS. 28A & B). This effect is also seen following stimulation with the lowest VEGF-A₁₆₅ dose tested (1 ng/ml-FIG. 28C). The CUB domain of VEGF-X therefore appears to show antiproliferative activity on HUVECs, even in the presence of low VEGF-A₁₆₅ doses.

Tissue Distribution of mRNA

VEGF-A mRNA expression has been shown to be upregulated in a wide variety of human tumors (lung, breast, ovarian, colon, stomach, liver, pancreas, kidney, bladder and prostate-Takahashi et al, 1995). Tumor VEGF-A expression has been shown to correlate with tumor growth rate, microvascular density and tumor metastasis (Takahashi et al, 1995). It was thus of interest to examine the mRNA expression patterns of VEGF-X. Accordingly, Northern blot analysis of mRNA derived from different tissues has been carried out. The results indicate that although the VEGF-X mRNA is expressed at low levels, it is present in a wide range of tissues. PCR amplification of cDNA from a range of tissue sources supports this idea (FIG. 29A). The major mRNA species is approximately 3.1 kb in size. There is no significant upregulation seen in tumour cell lines or in tumour tissues tested (FIG. 29B), with the possible exception of the cell lines GI-117 (lung carcinoma) and SaOS-2 (osteosarcoma). The results of these initial tissue distribution studies do not, therefore, provide evidence for upregulation of VEGF-X in tumour growth, as is seen with VEGF-A.

Genomic Structure of the VEGF-X Gene

A genomic BAC clone covering the 3′ part of the VEGF-X locus was isolated by hybridisation screening of nylon filters containing a human BAC library. Direct sequencing of this clone using oligonucleotide primers based on the VEGF-X cDNA sequence allowed the determination of several intron/exon boundaries (FIG. 30). Interestingly, the position of the mRNA splice site within the PDGF domain (nt 1187/1188 in FIG. 30B) is conserved with respect to those in the VEGF-A and VEGF-D genes (Tischer et al, 1991; Rocchigiani et al, 1998).

Materials & Methods

PCR, Cloning, DNA Sequence Determination and BAC Screening.

All primers were purchased from Eurogentec, Seraing, Belgium. Insert-specific sequencing primers (15- and 16-mers) were designed by visual inspection of the DNA sequences. DNA was prepared on Qiagen-tip-20 columns or on Qiaquick spin columns (Qiagen GmbH, Dcisseldorf, Germany) and recovered from the spin columns in 30 μl Tris/EDTA-buffer (10 mM TrisHCl pH 7.5, 1 mM EDTA (sodium salt)). Sequencing reactions were performed using BigDye™ Terminator Cycle Sequencing Ready Reaction kits (Perkin Elmer, ABI Division, Foster City, Calif., USA) and were run on an Applied Biosystems 377 DNA sequencer (Perkin Elmer, ABI Division, Foster City, Calif., USA). Polymerase chain reactions were carried out according to standard procedures (Ausubel et al, 1997). The PCR fragments were cloned into vectors pCR2.1 (Invitrogen, Carlsbad, Calif. USA) or pCR-TOPO (Invitrogen,NL) according to the manufacturer's instructions. One of those vectors, plasmid VEGFX/pCR2.1 1TOPO FL was deposited on 1 Mar. 1999 under Accession No. LMBP 3925. After sequence determination, the inserts were cloned into the desired expression vectors (see FIGS. 19, 20, 21, 24 & 26)

A human genomic RAC library (Genome Systems, Inc., St Louis, Mich., USA) was screened by hybridisation to oligonucleotides derived from the VEGF-X cDNA sequence, according to the manufacturer's instructions. BAC DNA was prepared using a Qiagen plasmid midi kit (Qiagen GmbH, Dusseldorf, Germany according to the manufacturer's instructions with some modifications (after clearing of the lysate from chromosomal DNA, supernatants from individual preparations were pooled on a single column (tip 100), and after the 70% EtOH wash, the pellet was resuspended overnight at 4° C. in 100 μl TE). 20-mer sequencing primers were designed based on the known cDNA sequence, and sequencing carried out as above.

5′ RACE

In order to extend the cDNA clone in a 5′ direction RACE reactions were carried out. Since it was known that the mRNA is present in placenta and skeletal muscle, Marathon-Ready™ placenta and skeletal muscle cDNAs were purchased from Clontech (Palo Alto Calif. USA) and used according to the manufacturer's instructions. DNA fragments were excised from agarose gels, purified using QiaQuick PCR purification columns (Qiagen GmbH, Dusseldorf, Germany) and sequenced directly.

VEGF-X protein expression and purification DNA fragments encoding the desired protein sequences were amplified by PCR and cloned into appropriate expression vector systems.

For mammalian cell expression, the full coding sequence was cloned into the vector pcDNA6/V5-his (Invitrogen Leek, NL, see FIG. 19 for construct sequence), so as to add a C-terminal peptide tag to assist in detection and purification.

For insect cell expression the sequence of the predicted mature polypeptide was initially amplified to add an N-terminal 6His peptide and then cloned into the pMelBacB vector (Invitrogen, Leek, NL) to add an insect cell signal sequence. The entire insert was then PCR-cloned into the vector pFASTBAC-1 (LifeTechnologies, Gaithersburg, Mass., USA) for construction of a baculovirus according to the manufacturer's instructions.

For E. coli expression, the coding region was ECR .amplified to add a C-terminal 6His tag and then cloned into the vector pMAL-p2 (New England Biolabs, Beverly, Mass., USA). The coding sequence of this construct is shown in FIG. 21). The protein was purified first on Ni-NTA resin (Qiagen GmbH, Dusseldorf, Germany) and then on amylose resin (New England Biolabs, Beverly, Mass., USA), according to the manufacturers' instructions.

DNA sequences encoding the CUB and VEGF domain fragments of VEGF-X were PCR amplified and cloned into pET22b and pET21a (Novagen, Madison, Wis., USA) respectively. The CUB domain protein was prepared either from the periplasm or medium of induced cultures by standard methods (Ausubel et al, 1997). The protein was initially purified by precipitation with 20% ammonium sulphate. After overnight dialysis vs. 20 mM Tris Hcl pH7.5, 100 mM NaCl to remove ammonium sulphate, the protein was further purified on Ni-NTA resin as described above. The VEGF domain protein was expressed in insoluble form, and preparation of inclusion bodies was carried out using standard procedures (Ausubel et al 1997). Inclusion bodies were dissolved in 6M guanidine hydrochloride, 20 mM Tris Hcl pH8.0, 200 mM NaCl, 1 mM 2-mercaptoethanol, and purified on Ni-NTA resin (Qiagen GmbH, Düsseldorf, Germany) according to the manufacturer's instructions. The protein was refolded by dialysis against several changes of buffer containing decreasing concentrations of denaturant.

Analysis of protein glycosylation was carried out using EndoH (Roche Molecular Biochemicals, Brussels, BE) according to the manufacturer's instructions.

Cell Proliferation Assay

Human umbilical vein endothelial cells (HUVECs) (Clonetics, San Diego, Calif.) were trypsinized with 0.05% trypsin/0.53 mM EDTA (Gibco, Gaithersburg, Md.), resuspended in the EGM-2(Clonetics, San Diego, Calif.), counted, and distributed in a 96-well tissue culture plate at 5,000 cells/well. Following cell attachment and monolayer formation (16 hours), cells were stimulated with various concentrations of truncated VEGF-X (CUB domain or VEGF domain) or dilutions of culture supernatants of the full-length VEGF-X (COS 7 or HEK293) in DMEM (Gibco, Gaithersburg, Md.) containing 0.5% to 2% FBS (HyClone, Logan, Utah) as indicated. For human fetal dermal fibroblasts (American Type Culture Collection, Rockville, Md.), the growth medium was replaced by DMEM containing 0.1% BSA (Sigma, St. Louise, Mo.) with or without various concentrations of truncated VEGF-K proteins. For HCASMC (Clonetics, San Diego, Calif.), the medium was replaced by DMEM containing 0.5% FBS. The cells were treated for a further 24 hr-72 hr. For the measurement of proliferation, the culture media were replaced with 100 μl of DMEM containing 5% FBS and 3 μCi/ml of [3H]-thymidine (Amersham, Arlington Heights, Ill.). Following pulse labeling, cells were fixed with methanol/acetic acid (3:1, vol/vol) for 1 hour at room temperature. The cells were washed twice with 250 μl/well of 80% methanol. The cells were solubilized in 0.05% trypsin (100 μl/well) for 30 minutes then in 0.5% SDS (100 μl/well) for another 30 minutes. Aliquots of cell lysates (180 μl) were combined with 2 ml of scintillation cocktail (Fisher, Springfiled, N.J.) and the radioactivity of cell lysates was measured using a liquid scintillation counter (Wallac 1409). In each case, samples were performed in quadruplicate.

Chemotaxis Assay

The chemotactic response of HUVECs was assayed using a 48-well modified Boyden chamber (NeuroProbe, Cabin John, Md.) and collagen-coated (0.1 mg/ml type I collagen, Collaboratic Biomedical, Bedford, Mass.) polycarbonate membrane filters with a pore diameter of 8 μm (NeuroProbe, Cabin John, Md.). Cell suspensions (15,000/well) were loaded to the upper part of the chemotaxis chamber and stimulated for 4 hours with rhVEGF₁₆₅, (0.1-10 ng/ml) (Calbiochem, San Diego, Calif.) or various concentrations of truncated VEGF-X (PDGF domain). Cells remaining on the top of the membrane were removed. Migration was assessed by counting the number of cells that migrated to the lower side of the filter membrane. The membrane was fixed with 10% formaldehyde for 15 min, followed by staining with Gill's hemotoxylin III (Poly Scientific, Bay Shore, N.Y.). The assay was performed in triplicates and six independent high power fields per well were counted using a light microscope at 250 magnification. The results were expressed as the fold of unstimulated cells (EGM containing 0.1% BSA).

In Vitro Angiogenesis Assay

In vitro angiogenesis in fibrin gels was quantitated using spheroids of human umbilical vein endothelial cells (Korff et al., 1998). To generate endothelial cell spheroids of defined size and cell number, a specific number of cells (˜800 cells per spheroid) was suspended in EGM-2 culture medium containing 20% methylcellulpse (Sigma, St. Louis, Mo.), seeded into nonadherent round-bottom 96-well plates. All suspended cells in one well contributed to the formation of a single endothelial cell spheroid within 24 hours. A fibrin gel stock solution was prepared freshly prior to use by mixing 3 mg/ml fibrinogen (Calbiochem, San Diego, Calif.) in Medium 199(Gibco, Gaithersburg, Md.). Assays were performed in 24-well culture plates. The 1 ml fibrinogen stock was mixed with 50 HUVEC spheroids and the corresponding test substance including rh-VEGF₁₆₅ or various concentration of VEGF-X. The spheriod-containing fibrinogen was rapidly transferred into 24-well plates. Fifteen microliters of thrombin (100 NIH U/ml stock, Sigma, St. Louis, Mo.) was added to the gel for the fibrin gel formation. The gel formation usually occurred within 30 seconds. After gel formation, 1 m/well of Medium 199 supplemented with 20% FBS, 1 mg/ml ε-aminocaproic acid (Calbiochem, San Diego, Calif.) and antibiotics were added. The gel was incubated at 37° C. (5% CO₂, 954% air, 00% humidity). After 3 days, in vitro angiogenesis was quantitated by measuring the length of the three longest capillary sprouts that had grown out of each spheroid (100× magnification), analyzing at least 10 spheroids per experimental group and experiment.

Matrigel House Assay

The matrigel mouse assay is carried out as described by Passanti et al (1992).

Analysis of VEGF-X Gene Expression by RT-PCR Analysis.

Oligonucleotide primers VEGF-E2 and VEGF-X14 (FIG. 16; FIG. 5) were used for the specific PCR amplification of a 350 bp fragment from VEGF-X. PCR amplifications were performed on human multiple tissue cDNA (MTC™) panels (Clontech human MTC panels I and II and human Tumor MTC panel) normalised to the mRNA expression levels of six different housekeeping genes. In addition, cDNA was made from different tumor cell cultures (Caco-2 colorectal adenocarcinoma; T-84 colorectal carcinoma; MCF-7 breast adenocarcinoma; T-47D breast ductal gland carcinoma; HT1080 bone fibrosarcoma; SaOS-2 osteosarcoma; SK-N-MC neuroblastoma; HepG2 hepatoblastoma; JURKAT T-cell leukemia and THP-1 myelomonocytic leukemia). For the preparation of tumor cell cDNA, cells were homogenised and total RNA prepared using the RNeasy Mini kit (Qiagen GmbH, Hilden, Germany) according to manufacturer's instructions. 1 μg of total RNA was reverse transcribed using oligo(dT)15 as a primer and 50 U of Expand™ Reverse Transcriptase (Boehringer Mannheim, Mannheim, Germany) according to the manufacturer's instructions. FCR reactions with VEGF-X-specific or qlyceraldehyde-3-phosphate dehydrogenase (G3PDH)-specific primers were then performed on 1 μl of this cDNA. For all cDNAs, PCR reactions with VEGF-X specific primers were performed in a total volume of 50 μl, containing 5 μl (±1 ng) of cDNA, 1×Advantage KlenTaq PCR reaction buffer, 0.2 mM dNTP, 250 nM of primers VEGF-E2 and VEGF-Xl4 and 1 μl of Advantage KlenTaq polymerase mix. Samples were heated to 95° C. for 30 s and cycling was done for 30 s at 95° C. and 30 s at 68° C. for 25, 30 or 35 cycles. Control reactions using specific primers that amplify a 1 kb fragment of the housekeeping gene G3PDH were also performed according to the manufacturer's instructions.

Northern Blot Analysis of VEGF-X.

Northern blots containing 2 μg of poly(A)-rich RNA derived from different human tissues (Clontech Laboratories; MTN™ blot, MTN™ blot II and Cancer Cell Line MTN™ blot) were hybridized according to the manufacturers instructions with a α-[³²P]-dCTP random-priming labelled (Multiprime labelling kit, Roche Diagnostics) 293 bp specific VEGF-X fragment (PinAl-Stul fragment including 92 bp of the 3′ end coding region and 201 bp of the 3′ untranslated region of VEGF-X). The blots were hybridized overnight at 68° C. and final washes at high stringency were at 68° C. in 0.1× SSC/0.1% SDS. The membranes were autoradiographed for 1 to 3 days with intensifying screens.

Full Length VEGF-X

The effect of full length VEGF-X on proliferation of HUVEC cells was determined by the ³H-Thymidine incorporation assay. HuVEC cells were serum starved for 24 hours prior to treatment with the full length VEGF-x at the concentration range from 100 pg/ml-10 μg/ml. There was no effect of VEGF-X at 100 pg/ml-10 ng/ml on endothelial cell proliferation. At the higher concentrations of FL-VEGF-X (100 ng/ml and 1 μg/ml) there was a marked inhibition of endothelial cell proliferation. This is probably due to the very high endotoxin level in the samples. The VEGF-X sample was purified in order to decrease the endotoxin level and is currently tested in the cell proliferation assay.

The Summary from Testing the CUB Domain

The effect of CUB domain on inhibition of HuVEC prolieration either serum- (2%), rh-VEGF or bFGF-stimulated, was assessed by the ³H-Thymidine incorporation assay. Cells were serum starved followed by the treatment with the CUB domain and various growth factors. Results showed that the CUB domain inhibited endothelial cell proliferation, either serum- (24), rh-VEGF or bFGF-stimulated in a dose dependent manner with maximal inhibition at 10 μg/ml. There was approximanntely a 2-fold inhibition of proliferation (at 10 μg/ml) of cells stimulated with VEGF and bFGF and nearly a 5-fold inhibition of cells stimulated with serum (2%). Results with the LDH assay showed that there was no cytotoxicity associated with the inhibition of cell proliferation by the CUB domain.

Therefore, the N-terminus of the polypeptide from FIG. 10 has been shown to possess a CUB domain. When database searches are carried out using the full-length coding sequence the best matches (i.e. for a BLAST search, those with the lowest probability score) are found with the CUB domain rather than with the VEGF-like domain. The best match from searching release 37 of the SWISSPROT database (February 1999) is to the-CUB domain of a neuropilin from Xenopus laevis, and the matches to the CUB domains of human neuropilins 1 and 2 are also more significant than matches to the VEGFs.

This similarity is provocative, given the identification of neuropilin-1 and -2 as cellular receptors for the VEGF-A 165 (Stoker et al. 1998, reviewed in Neufeld et al. 1999). It is plausible therefore that VEGF-X could exert dual regulatory effects: via interaction with the tyrosine kinase VEGF-receptors mediated by the VEGF-like domain, as well as via interaction with VEGF isoforms or with the neurophilin receptors, mediated by the CUB domain.

To the best of our understanding the latter would be entirely novel, and searches on the-most recent release of the Incyte database do not reveal any other proteins containing both CUB and VEGF-like domains. This arrangement of domains suggests possible positive or negative models of regulation:

Positive- the VEGF-like domain is able to interact productively with the tyrosine kinase VEGF receptors giving activation, and the CUB domain is able to interact productively with the neuropilin receptor giving activation.

Negative- the VEGF-like domain does not interact productively with the tyrosine kinase VEGF receptors, either preventing receptor dimerisation or blocking the VEGF binding sites. Further, the CUB domain does not interact productively with the neuropilin receptors, either preventing receptor activation or blocking the VEGF binding sites, or indeed by binding to VEGF isoforms and preventing their interaction with receptors.

TABLE 1 ORIGINAL RESIDUE EXEMPLARY SUBSTITUTIONS ALA SER, THR ARG LYS ASN HIS, SER ASP GLU, ASN CYS SER GLN ASN, HIS GLU ASP, GLU GLY ALA, SER HIS ASN, GLN ILE LEU, VAL, THR LEU ILE, VAL LYS ARG, GLN, GLU, THR MET LEU ILE, VAL PHE LEU, TYR SER THR, ALA, ASN THR SER, ALA TRP ARG, SER TYR PHE VAL ILE, LEU ALA PRO ALA

References

1. Ausubel, E M, R Brent, R E Kingston, D D Moore, J G Seidman, J A Smith, K Struhl (Eds). (1997) Current Protocols in Molecular Biology, John Wiley and Sons.

2. von Heijne, G. (1986) Nucleic Acids Res. 14, 4683-4690.

3. Muller, Y A, B Li, H W Christinger, J A Wells, B C Cunningham and A M de Vos. (1997) Vascular endothelial growth factor: crystal structure and functional mapping of the kinase domain receptor binding site. Proc. Natl. Acad. Sci USA 94, 7192-7197.

4. Korff, T and Augustic, H. G. (1998) Integration of endothelial cells in multicellular spheroids prevents; apoptosis and induced differentiation. The Journal of Cell Biology. 143, 1341-1352

5. Christinger, H W, Y A Muller, L T Berleau, B A Keyt, B C Cunningham, N Ferrara and A M de Vos. (1996) PROTEINS' Structure, Function and Genetics 26, 353-357.

6. Achen, M G, M Jeltsch, E Kukk, T Makinen, A Vitali, A F Wilks, K Alitalo and S A Stacker. (1998) Proc. Natl. Acad. Sci USA 95, 548-553.

7. Siemeister, G, B Schnurr, K Mohrs, C Schachtele, C Marme and G Martiny-Baron. (1996) Biochem. Blophys. Res. Commun. 222, 249-255.

8. Soker, S, S Takashima, H Q Miao, G Neufeld and M Klagsbrun (1998). Neuropilin-1 is expressed by endothelial and tumor cells as an isoform-specific receptor for vascular endothelial growth factor, Cell 92: 735-745.

9. Neufeld, G, T Cohen, S Gengrinovitch and Z Poltora)k (1999). Vascular endothelial growth factor and its receptors, FASEB J. 13:9-22.

10. Oefner, C., D'Arcy, A., Winkler, F. K., Eggimann, B. and Hosang, M. (1992). Crystal structure of human platelet-derived growth factor BB. EMBO J. 11, 3921-3926.

11. Passanti, A., Taylor, R. M., Pili, R., Guo, Y., Long, P. V., Haney, J. A., Pauly, R., Grant, D. S. and Martin, G. R. (1992) A simple, quantitative method for assessing angiogenesis and antiangiogenic agents using reconstituted basement membrane, heparin and fibroblast growth factor. Laboratory Investigation, 67, 519-528.

12. Rocchigiani, M., Lestingi, M., Luddi, A., Orlandini, M., Franco, B., Rossi, E., Ballabio, A., Zuffairdi, O. and Oliviero, S. (1990). Human FIGF: cloning, gene structure, and mapping to chromosome Xp22.1 between the PIGA and the GRPR genes. Genomics, 47, 207-216.

13. Takahashi, Y., Kitadai, Y., Bucana, C. D., Cleary, K. R. and Ellis, L. M. (1995). Expression of vascular endothelial growth factor and its receptor, KDR, correlates with vascularity, metastasis and proliferation of human colon cancer. Cancer Research, 55: 3964-3968.

14. Tischer, E., Mitchell, R., Hartman, T., Silva, M., Gospodarowicz, D., Fiddes, J. C. and Abraham, J. A. (1991). The human gene for vascular endothelial growth factor: Multiple protein forms are encoded through alternative exon splicing. J. Biol. Chen. 266, 11947-11954.

Sequence Listing

Sequence ID No 1 corresponds to the amino acid sequence from position 23 to 345 of the amino acids sequence illustrated in, FIG. 10.

Sequence ID No 2 is the amino,acid sequence illustrated in FIG. 10.

Sequence ID No 3 corresponds to the sequence from position 257 to 1291 of the nucleotide sequence illustrated in FIG. 9.

Sequence ID No 4 corresponds to the polynucleotide sequence of VEGFX1 illustrated in FIG. 3.

Sequence ID No 5 corresponds to the polynucleotide sequence of VEGFX2 illustrated in FIG. 3.

Sequence ID No 6 corresponds to the polynucleotide sequence of VEGFX3 illustrated in FIG. 3.

Sequence ID No 7 corresponds to the polynucleotide sequence of VEGFX4 illustrated in FIG. 3.

Sequence ID No 8 corresponds to the polynucleotide sequence of VEGFX5 illustrated in FIG. 3.

Sequence ID No 9 corresponds to the polynucleotide sequence of VEGFX6 illustrated in FIG. 3.

Sequence ID No 10 corresponds to the polynucleotide sequence of VEGFX7 illustrated in FIG. 3.

Sequence ID No 11 corresponds to the polynucleotide sequence of VEGFX8 illustrated in FIG. 3.

Sequence ID No 12 corresponds to the polynucleotide sequence of VEGFX9 illustrated in FIG. 3.

Sequence ID No 13 corresponds to the polynucleotide 'sequence of VEGFX10 illustrated in FIG. 3.

Sequence ID No 14 corresponds to the polynucleotide sequence of VEGFX11 illustrated in FIG. 5.

Sequence ID No 15 corresponds to the polynucleotide sequence of VEGFX12 illustrated in FIG. 5.

Sequence ID No 16 corresponds to the polynucleotide sequence of VEGFX 13 illustrated in FIG. 5.

Sequence ID No 17 corresponds to the polynucleotide sequence of VEGFX 14 illustrated in FIG. 5.

Sequence ID No 18 corresponds to the polynucleotide sequence 5′-1 in FIG. 8.

Sequence ID No 19 corresponds to the polynucleotide sequence 5′-2 in FIG. 8.

Sequence ID No 20 corresponds to the polynucleotide sequence of VEGFX6 illustrated in FIG. 13.

Sequence ID No 21 corresponds to the polynucleotide sequence of VEGFX7 illustrated in FIG. 13.

Sequence ID No 22 corresponds to the polynucleotide sequence of VEGFX8 illustrated in FIG. 13.

Sequence ID No 23 corresponds to the polynucleotide sequence of VEGFX9 illustrated in FIG. 13.

Sequence ID No 24 corresponds to the polynucleotide sequence of VEGFX10 illustrated in FIG. 13.

Sequence ID No 25 corresponds to the polynucleotide sequence of VEGBAC2 illustrated in FIG. 13.

Sequence ID No 26 corresponds to a polypeptide having the amino acid sequence from amino acid position 40 to 150 of the sequence of FIG. 10.

Sequence ID No 27 corresponds to a polypeptide having the amino acid sequence illustrated in FIG. 26.

Sequence ID No 28 corresponds to the sequence from position 5 to 508 of the nucleotide sequence illustrated in FIG. 26.

Sequence ID NO: 29 corresponds to the sequence from position 214 to 345 of the amino acid sequence illustrated in FIG. 10.

130 1 323 PRT Homo sapiens 1 Glu Ser Asn Leu Ser Ser Lys Phe Gln Phe Ser Ser Asn Lys Glu Gln 1 5 10 15 Tyr Gly Val Gln Asp Pro Gln His Glu Arg Ile Ile Thr Val Ser Thr 20 25 30 Asn Gly Ser Ile His Ser Pro Arg Phe Pro His Thr Tyr Pro Arg Asn 35 40 45 Thr Val Leu Val Trp Arg Leu Val Ala Val Glu Glu Asn Val Trp Ile 50 55 60 Gln Leu Thr Phe Asp Glu Arg Phe Gly Leu Glu Asp Pro Glu Asp Asp 65 70 75 80 Ile Cys Lys Tyr Asp Phe Val Glu Val Glu Glu Pro Ser Asp Gly Thr 85 90 95 Ile Leu Gly Arg Trp Cys Gly Ser Gly Thr Val Pro Gly Lys Gln Ile 100 105 110 Ser Lys Gly Asn Gln Ile Arg Ile Arg Phe Val Ser Asp Glu Tyr Phe 115 120 125 Pro Ser Glu Pro Gly Phe Cys Ile His Tyr Asn Ile Val Met Pro Gln 130 135 140 Phe Thr Glu Ala Val Ser Pro Ser Val Leu Pro Pro Ser Ala Leu Pro 145 150 155 160 Leu Asp Leu Leu Asn Asn Ala Ile Thr Ala Phe Ser Thr Leu Glu Asp 165 170 175 Leu Ile Arg Tyr Leu Glu Pro Glu Arg Trp Gln Leu Asp Leu Glu Asp 180 185 190 Leu Tyr Arg Pro Thr Trp Gln Leu Leu Gly Lys Ala Phe Val Phe Gly 195 200 205 Arg Lys Ser Arg Val Val Asp Leu Asn Leu Leu Thr Glu Glu Val Arg 210 215 220 Leu Tyr Ser Cys Thr Pro Arg Asn Phe Ser Val Ser Ile Arg Glu Glu 225 230 235 240 Leu Lys Arg Thr Asp Thr Ile Phe Trp Pro Gly Cys Leu Leu Val Lys 245 250 255 Arg Cys Gly Gly Asn Cys Ala Cys Cys Leu His Asn Cys Asn Glu Cys 260 265 270 Gln Cys Val Pro Ser Lys Val Thr Lys Lys Tyr His Glu Val Leu Gln 275 280 285 Leu Arg Pro Lys Thr Gly Val Arg Gly Leu His Lys Ser Leu Thr Asp 290 295 300 Val Ala Leu Glu His His Glu Glu Cys Asp Cys Val Cys Arg Gly Ser 305 310 315 320 Thr Gly Gly 2 345 PRT Homo sapiens 2 Met Ser Leu Phe Gly Leu Leu Leu Leu Thr Ser Ala Leu Ala Gly Gln 1 5 10 15 Arg Gln Gly Thr Gln Ala Glu Ser Asn Leu Ser Ser Lys Phe Gln Phe 20 25 30 Ser Ser Asn Lys Glu Gln Tyr Gly Val Gln Asp Pro Gln His Glu Arg 35 40 45 Ile Ile Thr Val Ser Thr Asn Gly Ser Ile His Ser Pro Arg Phe Pro 50 55 60 His Thr Tyr Pro Arg Asn Thr Val Leu Val Trp Arg Leu Val Ala Val 65 70 75 80 Glu Glu Asn Val Trp Ile Gln Leu Thr Phe Asp Glu Arg Phe Gly Leu 85 90 95 Glu Asp Pro Glu Asp Asp Ile Cys Lys Tyr Asp Phe Val Glu Val Glu 100 105 110 Glu Pro Ser Asp Gly Thr Ile Leu Gly Arg Trp Cys Gly Ser Gly Thr 115 120 125 Val Pro Gly Lys Gln Ile Ser Lys Gly Asn Gln Ile Arg Ile Arg Phe 130 135 140 Val Ser Asp Glu Tyr Phe Pro Ser Glu Pro Gly Phe Cys Ile His Tyr 145 150 155 160 Asn Ile Val Met Pro Gln Phe Thr Glu Ala Val Ser Pro Ser Val Leu 165 170 175 Pro Pro Ser Ala Leu Pro Leu Asp Leu Leu Asn Asn Ala Ile Thr Ala 180 185 190 Phe Ser Thr Leu Glu Asp Leu Ile Arg Tyr Leu Glu Pro Glu Arg Trp 195 200 205 Gln Leu Asp Leu Glu Asp Leu Tyr Arg Pro Thr Trp Gln Leu Leu Gly 210 215 220 Lys Ala Phe Val Phe Gly Arg Lys Ser Arg Val Val Asp Leu Asn Leu 225 230 235 240 Leu Thr Glu Glu Val Arg Leu Tyr Ser Cys Thr Pro Arg Asn Phe Ser 245 250 255 Val Ser Ile Arg Glu Glu Leu Lys Arg Thr Asp Thr Ile Phe Trp Pro 260 265 270 Gly Cys Leu Leu Val Lys Arg Cys Gly Gly Asn Cys Ala Cys Cys Leu 275 280 285 His Asn Cys Asn Glu Cys Gln Cys Val Pro Ser Lys Val Thr Lys Lys 290 295 300 Tyr His Glu Val Leu Gln Leu Arg Pro Lys Thr Gly Val Arg Gly Leu 305 310 315 320 His Lys Ser Leu Thr Asp Val Ala Leu Glu His His Glu Glu Cys Asp 325 330 335 Cys Val Cys Arg Gly Ser Thr Gly Gly 340 345 3 1035 DNA Homo sapiens 3 atgagcctct tcgggcttct cctgctgaca tctgccctgg ccggccagag acaggggact 60 caggcggaat ccaacctgag tagtaaattc cagttttcca gcaacaagga acagaacgga 120 gtacaagatc ctcagcatga gagaattatt actgtgtcta ctaatggaag tattcacagc 180 ccaaggtttc ctcatactta tccaagaaat acggtcttgg tatggagatt agtagcagta 240 gaggaaaatg tatggataca acttacgttt gatgaaagat ttgggcttga agacccagaa 300 gatgacatat gcaagtatga ttttgtagaa gttgaggaac ccagtgatgg aactatatta 360 gggcgctggt gtggttctgg tactgtacca ggaaaacaga tttctaaagg aaatcaaatt 420 aggataagat ttgtatctga tgaatatttt ccttctgaac cagggttctg catccactac 480 aacattgtca tgccacaatt cacagaagct gtgagtcctt cagtgctacc cccttcagct 540 ttgccactgg acctgcttaa taatgctata actgccttta gtaccttgga agaccttatt 600 cgatatcttg aaccagagag atggcagttg gacttagaag atctatatag gccaacttgg 660 caacttcttg gcaaggcttt tgtttttgga agaaaatcca gagtggtgga tctgaacctt 720 ctaacagagg aggtaagatt atacagctgc acacctcgta acttctcagt gtccataagg 780 gaagaactaa agagaaccga taccattttc tggccaggtt gtctcctggt taaacgctgt 840 ggtgggaact gtgcctgttg tctccacaat tgcaatgaat gtcaatgtgt cccaagcaaa 900 gttactaaaa aataccacga ggtccttcag ttgagaccaa agaccggtgt caggggattg 960 cacaaatcac tcaccgacgt ggccctggag caccatgagg agtgtgactg tgtgtgcaga 1020 gggagcacag gagga 1035 4 22 DNA Artificial Sequence Description of Artificial Sequence primer 4 aaaatgtatg gatacaactt ac 22 5 23 DNA Artificial Sequence Description of Artificial Sequenceprimer 5 gtttgatgaa agatttgggc ttg 23 6 22 DNA Artificial Sequence Description of Artificial Sequence primer 6 tttctaaagg aaatcaaatt ag 22 7 20 DNA Artificial Sequence Description of Artificial Sequence primer 7 gataagattt gtatctgatg 20 8 17 DNA Artificial Sequence Description of Artificial Sequence primer 8 gatgtctcct ctttcag 17 9 18 DNA Artificial Sequence Description of Artificial Sequence primer 9 gcacaactcc taattctg 18 10 18 DNA Artificial Sequence Description of Artificial Sequence primer 10 agcacctgat tccgttgc 18 11 20 DNA Artificial Sequence Description of Artificial Sequence primer 11 tagtacatag aatgttctgg 20 12 19 DNA Artificial Sequence Description of Artificial Sequence primer 12 aagagacata cttctgtac 19 13 21 DNA Artificial Sequence Description of Artificial Sequenceprimer 13 ccaggtacaa taagtgaact g 21 14 28 DNA Artificial Sequence Description of Artificial Sequenceprimer 14 cctttagaaa tctgttttcc tggtacag 28 15 31 DNA Artificial Sequence Description of Artificial Sequenceprimer 15 ggaaaatatt catcagatac aaatcttatc c 31 16 22 DNA Artificial Sequence Description of Artificial Sequenceprimer 16 ggtccagtgg caaagctgaa gg 22 17 29 DNA Artificial Sequence Description of Artificial Sequenceprimer 17 ctggttcaag atatcgaata aggtcttcc 29 18 24 DNA Artificial Sequence Description of Artificial Sequenceprimer 18 tttgtttaaa ccttgggaaa ctgg 24 19 21 DNA Artificial Sequence Description of Artificial Sequenceprimer 19 gtccaggttt tgctttgatc c 21 20 30 DNA Artificial Sequence Description of Artificial Sequenceprimer 20 aattggatcc gagagtggtg gatctgaacc 30 21 30 DNA Artificial Sequence Description of Artificial Sequenceprimer 21 aattggatcc gggaagaaaa tccagagtgg 30 22 40 DNA Artificial Sequence Description of Artificial Sequenceprimer 22 ggttgaattc attatttttt agtaactttg cttgggacac 40 23 31 DNA Artificial Sequence Description of Artificial Sequenceprimer 23 aattgaattc attatcctcc tgtgctccct c 31 24 60 DNA Artificial Sequence Description of Artificial Sequence primer 24 aattggatcc ggagtctcac catcaccacc atcatgaatc caacctgagt agtaaattcc 60 25 34 DNA Artificial Sequence Description of Artificial Sequence primer 25 aattgaattc gctatcctcc tgtgctccct ctgc 34 26 111 PRT Homo sapiens 26 Gly Val Gln Asp Pro Gln His Glu Arg Ile Ile Thr Val Ser Thr Asn 1 5 10 15 Gly Ser Ile His Ser Pro Arg Phe Pro His Thr Tyr Pro Arg Asn Thr 20 25 30 Val Leu Val Trp Arg Leu Val Ala Val Glu Glu Asn Val Trp Ile Gln 35 40 45 Leu Thr Phe Asp Glu Arg Phe Gly Leu Glu Asp Pro Glu Asp Asp Ile 50 55 60 Cys Lys Tyr Asp Phe Val Glu Val Glu Glu Pro Ser Asp Gly Thr Ile 65 70 75 80 Leu Gly Arg Trp Cys Gly Ser Gly Thr Val Pro Gly Lys Gln Ile Ser 85 90 95 Lys Gly Asn Gln Ile Arg Ile Arg Phe Val Ser Asp Glu Tyr Phe 100 105 110 27 168 PRT Homo sapiens 27 Met Ala Met Asp Ile Gly Ile Asn Ser Asp Pro Glu Ser His His His 1 5 10 15 His His His Glu Ser Asn Leu Ser Ser Lys Phe Gln Phe Ser Ser Asn 20 25 30 Lys Glu Gln Asn Gly Val Gln Asp Pro Gln His Glu Arg Ile Ile Thr 35 40 45 Val Ser Thr Asn Gly Ser Ile His Ser Pro Arg Phe Pro His Thr Tyr 50 55 60 Pro Arg Asn Thr Val Leu Val Trp Arg Leu Val Ala Val Glu Glu Asn 65 70 75 80 Val Trp Ile Gln Leu Thr Phe Asp Glu Arg Phe Gly Leu Glu Asp Pro 85 90 95 Glu Asp Asp Ile Cys Lys Tyr Asp Phe Val Glu Val Glu Glu Pro Ser 100 105 110 Asp Gly Thr Ile Leu Gly Arg Trp Cys Gly Ser Gly Thr Val Pro Gly 115 120 125 Lys Gln Ile Ser Lys Gly Asn Gln Ile Arg Ile Arg Phe Val Ser Asp 130 135 140 Glu Tyr Phe Pro Ser Glu Pro Gly Phe Cys Ile His Tyr Asn Ile Val 145 150 155 160 Met Pro Gln Phe Thr Glu Ala Val 165 28 504 DNA Homo sapiens 28 atggccatgg atatcggaat taattcggat ccggagtctc accatcacca ccatcatgaa 60 tccaacctga gtagtaaatt ccagttttcc agcaacaagg aacagaacgg agtacaagat 120 cctcagcatg agagaattat tactgtgtct actaatggaa gtattcacag cccaaggttt 180 cctcatactt atccaagaaa tacggtcttg gtatggagat tagtagcagt agaggaaaat 240 gtatggatac aacttacgtt tgatgaaaga tttgggcttg aagacccaga agatgacata 300 tgcaagtatg attttgtaga agttgaggaa cccagtgatg gaactatatt agggcgctgg 360 tgtggttctg gtactgtacc aggaaaacag atttctaaag gaaatcaaat taggataaga 420 tttgtatctg atgaatattt tccttctgaa ccagggttct gcatccacta caacattgtc 480 atgccacaat tcacagaagc tgtg 504 29 132 PRT Homo sapiens 29 Asp Leu Tyr Arg Pro Thr Trp Gln Leu Leu Gly Lys Ala Phe Val Phe 1 5 10 15 Gly Arg Lys Ser Arg Val Val Asp Leu Asn Leu Leu Thr Glu Glu Val 20 25 30 Arg Leu Tyr Ser Cys Thr Pro Arg Asn Phe Ser Val Ser Ile Arg Glu 35 40 45 Glu Leu Lys Arg Thr Asp Thr Ile Phe Trp Pro Gly Cys Leu Leu Val 50 55 60 Lys Arg Cys Gly Gly Asn Cys Ala Cys Cys Leu His Asn Cys Asn Glu 65 70 75 80 Cys Gln Cys Val Pro Ser Lys Val Thr Lys Lys Tyr His Glu Val Leu 85 90 95 Gln Leu Arg Pro Lys Thr Gly Val Arg Gly Leu His Lys Ser Leu Thr 100 105 110 Asp Val Ala Leu Glu His His Glu Glu Cys Asp Cys Val Cys Arg Gly 115 120 125 Ser Thr Gly Gly 130 30 300 DNA Artificial Sequence misc_feature (41) (41) n is a, c, g, t, or u 30 cacaaatcac tcaccgacgt ggccctggag caccatgagg ngtgtgactg tgtgtgcaga 60 gggagcacag gaggatagcc gcatcaccac cagcagctct tgcccagagc tgtgcagtgc 120 agtggctgat tctattagag aacgtatgcg ttatctccat ccttaatctc agttgtttgc 180 ttcaaggacc tttcatcttc aggatttaca gtgcattctg aaagaggaga catcaaacag 240 aattaggagt tgtgcaacag ctcttttgag aggaggctaa aggacaggag aanaggtctt 300 31 284 DNA Artificial Sequence Description of Artificial SequenceHuman EST 31 tgcagtgcag tggctgattc tattagagaa cgtatgcgtt atctccatcc ttaatctcag 60 ttgtttgctt caaggacctt tcatcttcag gatttacagt gcattctgaa agaggagaca 120 tcaaacagaa ttaggagttg tgcaacagct cttttgagag gaggcctaaa ggacaggaga 180 aaaggtcttc aatcgtggaa agaaaattaa atgttgtatt aaatagatca ccagctagtt 240 tcagagttac catgtacgta ttccactagc tgggttctgt attt 284 32 275 DNA Artificial Sequence Description of Artificial Sequence Human EST 32 cacgaggtcc ttcagttgag accaaagacc ggtgtcaggg gattgcacaa atcactcacc 60 gacgtggccc tggagcacca tgaggagtgt gactgtgtgt gcagagggag cacaggggga 120 tagccgcatc accaccagca gctcttgccc agagctgtgc agtgcagtgg ctgattctat 180 tagagaacgt atgcgttatc tccatcctta atctcagttg tttgcttcaa ggacctttca 240 tcttcaggat ttacagtgca ttctgaaaga ggaga 275 33 278 DNA Artificial Sequence misc_feature (248) (248) n is a, c, g, t, or u 33 ggaggatagc cgcatcacca ccagcagctc ttgcccagag ctgtgcagtg cagtggctga 60 ttctattaga gaacgtatgc gttatctcca tccttaatct cagttgtttg cttcaaggac 120 ctttcatctt caggatttac agtgcattct gaaagaggag acatcaaaca gaattaggag 180 ttgtgcaaca gctcttttga gaggaggcct aaaggacagg agaaaaggtc ttcaatcgtg 240 gaaagaanat taaatgttgt attaaataga caccagct 278 34 275 DNA Artificial Sequence Description of Artificial Sequence Human EST 34 ggaggatagc cgcatcacca ccagcagctc ttgcccagag ctgtgcagtg cagtggctga 60 ttctattaga gaacgtatgc gttatctcca tccttaatct cagttgtttg cttcaaggac 120 ctttcatctt caggatttac atgcattctg aaagaggaga catcaaacag aattaggagt 180 tgtgcaacag ctcttttgag aggaggccta aaggacagga gaaaaggtct tcaatcgtgg 240 aaagaaaatt aaatgttgta ttaaatagat cacca 275 35 261 DNA Artificial Sequence Description of Artificial Sequence Human EST 35 gagaaccgat accattttct ggccaggttg tctcctggtt aaacgctgtg gtgggaactg 60 tgcctgttgt ctccacaatt gcaatgaatg tcaatgtgtc ccaagcaaag ttactaaaaa 120 ataccacgag gtccttcagt tgagaccaaa gaccggtgtc aggggattgc acaaatcact 180 caccgacgtg gccctggagc accatgagga gtgtgactgt gtgtgcagag ggagcacagg 240 aggatagccg catcaccacc a 261 36 279 DNA Artificial Sequence Description of Artificial Sequence Human EST 36 agaaaatcca gagtggtgga tctgaacctt ctaacagagg aggtaagatt atacagctgc 60 acacctcgta acttctcagt gtccataagg gaagaactaa agagaaccga taccattttc 120 tggccaggtt gtctcctggt taaacgctgt ggtgggaact gtgcctgttg tctccacaat 180 tgcaatgaat gtcaatgtgt cccaagcaaa gttactaaaa aataccacga ggtccttcag 240 ttgagaccaa agaccggtgt caggggattg cacaaatca 279 37 262 DNA Artificial Sequence Description of Artificial Sequence Human EST 37 aggaaatcaa attaggataa gatttgtatc tgatgaatat tttccttctg aaccttctaa 60 cagaggaggt aagattatac agctgcacac ctcgtaactt ctcagtgtcc ataagggaag 120 aactaaagag aaccgatacc attttctggc caggttgtct cctggttaaa cgctgtggtg 180 ggaactgtgc ctgttgtctc ccacaattgc aatgaatgtc aatgtgtccc aagcaaagtt 240 actaaaaaat accacgaggt cc 262 38 289 DNA Artificial Sequence misc_feature (35) (35) n is a, c, g, t, or u 38 atttcatctt caggatttac agtgcattct gaaanaggag aaatcaaaca naattaggag 60 ttgtgcaaca gctcttttga gaggaggcct aaaggacagg agaaaaggtc ttcaatcgtg 120 gaaanaaaat taaatgttgt attaaataga tcaccagcta gtttcagagt taccatgtac 180 gtattccact agctgggttc tgtatttcag ttctttcgat acggcttagg gtaatgtcag 240 tacaggaaaa aaactgtgca agtgagcacc tgattccgtt gccttgctt 289 39 245 DNA Artificial Sequence Description of Artificial Sequence Human EST 39 caaagttact aaaaaatacc acgaggtcct tcagttgaga ccaaagaccg gtgtcagggg 60 attgcacaaa tcactcaccg acgtggccct ggagcaccat gaggagtgtg actgtgtgtg 120 cagagggagc acaggaggat agccgcatca ccaccagcag ctcttgccca gagctgtgca 180 gtgcagtggc tgattctatt agagaacgta tgcgttatct ccatccttaa tctcagttgt 240 ttgct 245 40 247 DNA Artificial Sequence misc_feature (2) (2) n is a, c, g, t, or u 40 angagttgcc cagagctgtg cagtgcagtg gctgattcta ttagagaacg tatgcgttat 60 ctccatcctt aatctcagtt gtttgnttca aggacctttc atcttcagga tttacagtgc 120 attctgaaag aggagacatc aaacagaatt aggagttgtg caacagctct tttgagagga 180 ggcctaaagg ncaggagaaa aggtcttcaa tcgtggaaag aaaattaaat gttgtattaa 240 atagatc 247 41 232 DNA Artificial Sequence Description of Artificial Sequence Human EST 41 aggaaatcaa attaggataa gatttgtatc tgatgaatat tttccttctg aaccttctaa 60 cagaggaggt aagattatac agctgcacac ctcgtaactt ctcagtgtcc ataagggaag 120 aactaaagag aaccgatacc attttctggc caggttgtct cctggttaaa cgctgtggtg 180 ggaactgtgc ctgttgtctc cacaattgca atgaatgtca atgtgtccca ag 232 42 253 DNA Artificial Sequence Description of Artificial Sequence Human EST 42 gtgcattctg aaagaggaga catcaaacag aattaggagt tgtgcaacag ctcttttgag 60 aggaggccta aaggacagga gaaaaggtct tcaatcgtgg aaagaaaatt aaatgttgta 120 ttaaatagat caccagctag tttcagagtt accatgtacg tattccacta gctgggttct 180 gtatttcagt tctttcgata cggcttaggg taatgtcagt acaggaaaaa aactgtgcaa 240 gtgagcacct gat 253 43 265 DNA Artificial Sequence misc_feature (238) (238) n is a, c, g, t, or u 43 tgcaacagct cttttgagag gaggcctaaa ggacaggaga aaaggtcttc aatcgtggaa 60 agaaaattaa atgttgtatt aaatagatca ccagctagtt tcagagttac catgtacgta 120 ttccactagc tgggttctgt atttcagttc tttcgatacg gcttagggta atgtcagtac 180 aggaaaaaaa ctgtgcaagt gagcacctga ttccgttgcc ttgcttaacc ctaaagcncc 240 atgtcnnggg cnaaaancga aaaat 265 44 291 DNA Artificial Sequence misc_feature (61) (61) n is a, c, g, t, or u 44 ccttaatctc agttgtttgc ttcaaggacc tttcatcttc aggatttaca gtgcattctg 60 naagangaga catcaaacag aattaggngt tgtgcaaaag ctcttttgag aggaggccta 120 aaggacagga gaaaaggtct ncaatcgtgg aaagnaaatt aaatgttgta tnaaatngat 180 caccagctag tttcagagtt accatgtacg tattccacta gctgggncng tattcagtct 240 ttcggaacgg cttagggtaa tgtcagtaca gganaaaaac tgtgcagtga g 291 45 279 DNA Artificial Sequence misc_feature (205) (205) n is a, c, g, t, or u 45 attaaataga tcaccagcta gtttcagagt taccatgtac gtattccact agctgggttc 60 tgtatttcag ttctttcgat acggcttagg gtaatgtcag tacaggaaaa aaactgtgca 120 agtgagcacc tgattccgtt gccttggctt aactctaaag ctccatgtcc tgggcctaaa 180 atcgtataaa atctggattt ttttnttttt ttttgcgcat attcacatat gtaaaccagn 240 acattctatg tacnacaaac ctggttttta aaaaggaac 279 46 181 DNA Artificial Sequence Description of Artificial Sequence Human EST 46 ggctagtttc agagttacca tgtacgtatt ccactagctg ggttctgtat ttcagttctt 60 tcgatacggc ttagggtaat gtcagtacag gaaaaaaact gtgcaagtga gcacctgatt 120 ccgttgcctt gcttaactct aaagctccat gtcctgggcc taaaatcgta taaaatctgg 180 a 181 47 184 DNA Artificial Sequence misc_feature (54) (54) n is a, c, g, t, or u 47 aatagatcac cagctagttt cagagttacc atgtacgtat tccactagct gggntctgta 60 tttcagttcc tttcgatacg gcttagggta atgtcagtac aggaaaaaag ctgtgcaagt 120 gagcacctga ttccgttgcc ttgcttaact ctaaagctcc atgtcctggg cctaaaatcg 180 tata 184 48 290 DNA Artificial Sequence Description of Artificial Sequence Human EST 48 aaaggaacta tgttgctatg aattaaactt gtgtcgtgct gataggacag actggatttt 60 tcatatttct tattaaaatt tctgccattt agaagaagag aactacattc atggtttgga 120 agagataaac ctgaaaagaa gagtggcctt atcttcactt tatcgataag tcagtttatt 180 tgtttcattg tgtacatttt tatattctcc ttttgacatt ataactgttg gcttttctaa 240 tcttgttaaa tatatctatt tttaccaaag gtatttaata ttctttttta 290 49 300 DNA Artificial Sequence misc_feature (41) (41) n is a, c, g, t, or u 49 cacaaatcac tcaccgacgt ggccctggag caccatgagg ngtgtgactg tgtgtgcaga 60 gggagcacag gaggatagcc gcatcaccac cagcagctct tgcccagagc tgtgcagtgc 120 agtggctgat tctattagag aacgtatgcg ttatctccat ccttaatctc agttgtttgc 180 ttcaaggacc tttcatcttc aggatttaca gtgcattctg aaagaggaga catcaaacag 240 aattaggagt tgtgcaacag ctcttttgag aggaggctaa aggacaggag aanaggtctt 300 50 284 DNA Artificial Sequence Description of Artificial Sequence Human EST 50 tgcagtgcag tggctgattc tattagagaa cgtatgcgtt atctccatcc ttaatctcag 60 ttgtttgctt caaggacctt tcatcttcag gatttacagt gcattctgaa agaggagaca 120 tcaaacagaa ttaggagttg tgcaacagct cttttgagag gaggcctaaa ggacaggaga 180 aaaggtcttc aatcgtggaa agaaaattaa atgttgtatt aaatagatca ccagctagtt 240 tcagagttac catgtacgta ttccactagc tgggttctgt attt 284 51 301 DNA Artificial Sequence misc_feature (47) (47) n is a, c, g, t, or u 51 cttgttaaat atatctattt ttaccaaagg tatttaatat tctttantta tgacaactta 60 gatcaactat ttttagcttg gtaaattttt ctaaacacaa ttgttatagc cagaggaaca 120 aagatgatat aaaatattgt tgctctgaca aaaatacatg tatttcattc tcgtatggtg 180 ctagagttag attaatctgc attttaaaaa actgaattgg aatagaattg gtaagttgca 240 aagacttttt ganaataatt aaattatcat atcttccatt cctgttattg ggggagaaaa 300 t 301 52 275 DNA Artificial Sequence Description of Artificial Sequence Human EST 52 cacgaggtcc ttcagttgag accaaagacc ggtgtcaggg gattgcacaa atcactcacc 60 gacgtggccc tggagcacca tgaggagtgt gactgtgtgt gcagagggag cacaggggga 120 tagccgcatc accaccagca gctcttgccc agagctgtgc agtgcagtgg ctgattctat 180 tagagaacgt atgcgttatc tccatcctta atctcagttg tttgcttcaa ggacctttca 240 tcttcaggat ttacagtgca ttctgaaaga ggaga 275 53 288 DNA Artificial Sequence Description of Artificial Sequence Human EST 53 ttaaaaagga actatgttgc tatgaattaa acttgtgtca tgctgatagg acagactgga 60 tttttcatat ttcttattaa aatttctgcc atttagaaga agagaactac attcatggtt 120 tggaagagat aaacctgaaa agaagagtgg ccttatcttc actttatcga taagtcagtt 180 tatttgtttc attgtgtaca tttttatatt ctccttttga cattataact gttggctttc 240 taatctgtta aatatatcta tttttaccaa aggtatttaa tattcttt 288 54 278 DNA Artificial Sequence Description of Artificial Sequence Human EST 54 ggaggatagc cgcatcacca ccagcagctc ttgcccagag ctgtgcagtg cagtggctga 60 ttctattaga gaacgtatgc gttatctcca tccttaatct cagttgtttg cttcaaggac 120 ctttcatctt caggatttac agtgcattct gaaagaggag acatcaaaca gaattaggag 180 ttgtgcaaca gctcttttga gaggaggcct aaaggacagg agaaaaggtc ttcaatcgtg 240 gaaagaanat taaatgttgt attaaataga caccagct 278 55 275 DNA Artificial Sequence Description of Artificial Sequence Human EST 55 ggaggatagc cgcatcacca ccagcagctc ttgcccagag ctgtgcagtg cagtggctga 60 ttctattaga gaacgtatgc gttatctcca tccttaatct cagttgtttg cttcaaggac 120 ctttcatctt caggatttac atgcattctg aaagaggaga catcaaacag aattaggagt 180 tgtgcaacag ctcttttgag aggaggccta aaggacagga gaaaaggtct tcaatcgtgg 240 aaagaaaatt aaatgttgta ttaaatagat cacca 275 56 261 DNA Artificial Sequence Description of Artificial Sequence Human EST 56 gagaaccgat accattttct ggccaggttg tctcctggtt aaacgctgtg gtgggaactg 60 tgcctgttgt ctccacaatt gcaatgaatg tcaatgtgtc ccaagcaaag ttactaaaaa 120 ataccacgag gtccttcagt tgagaccaaa gaccggtgtc aggggattgc acaaatcact 180 caccgacgtg gccctggagc accatgagga gtgtgactgt gtgtgcagag ggagcacagg 240 aggatagccg catcaccacc a 261 57 279 DNA Artificial Sequence Description of Artificial Sequence Human EST 57 agaaaatcca gagtggtgga tctgaacctt ctaacagagg aggtaagatt atacagctgc 60 acacctcgta acttctcagt gtccataagg gaagaactaa agagaaccga taccattttc 120 tggccaggtt gtctcctggt taaacgctgt ggtgggaact gtgcctgttg tctccacaat 180 tgcaatgaat gtcaatgtgt cccaagcaaa gttactaaaa aataccacga ggtccttcag 240 ttgagaccaa agaccggtgt caggggattg cacaaatca 279 58 259 DNA Artificial Sequence Description of Artificial Sequence Human EST 58 agatgatata aaatattgtt gctctgacaa aaatacatgt atttcattct cgtatggtgc 60 tagagttaga ttaatctgca ttttaaaaaa ctgaattgga atagaattgg taagttgcaa 120 agactttttg aaaataatta aattatcata tcttccattc ctgttattgg agatgaaaat 180 aaaaagcaac ttatgaaagt agacattcag atccagccat tactaaccta ttcctttttt 240 ggggaaatct gagcctagc 259 59 284 DNA Artificial Sequence Description of Artificial Sequence Human EST 59 tttttaaaaa ggaactatgt tgctatgaat taaacttgtg tcgtgctgat aggacagact 60 ggatttttca tatttcttat taaaatttct gccatttaga agaagagaac tacattcatg 120 gtttggaaga gataaacctg aaaagaagag tggcctatct tcactttatc gataagtcag 180 tttatttgtt tcattgtgta catttttata ttctcctttg acatataact gttggctttt 240 ctaatctgtt aaatatatct atttttacca aaggtattta atat 284 60 262 DNA Artificial Sequence Description of Artificial Sequence Human EST 60 aggaaatcaa attaggataa gatttgtatc tgatgaatat tttccttctg aaccttctaa 60 cagaggaggt aagattatac agctgcacac ctcgtaactt ctcagtgtcc ataagggaag 120 aactaaagag aaccgatacc attttctggc caggttgtct cctggttaaa cgctgtggtg 180 ggaactgtgc ctgttgtctc ccacaattgc aatgaatgtc aatgtgtccc aagcaaagtt 240 actaaaaaat accacgaggt cc 262 61 289 DNA Artificial Sequence misc_feature (35) (35) n is a, c, g, t, or u 61 atttcatctt caggatttac agtgcattct gaaanaggag aaatcaaaca naattaggag 60 ttgtgcaaca gctcttttga gaggaggcct aaaggacagg agaaaaggtc ttcaatcgtg 120 gaaanaaaat taaatgttgt attaaataga tcaccagcta gtttcagagt taccatgtac 180 gtattccact agctgggttc tgtatttcag ttctttcgat acggcttagg gtaatgtcag 240 tacaggaaaa aaactgtgca agtgagcacc tgattccgtt gccttgctt 289 62 251 DNA Artificial Sequence misc_feature (10) (10) n is a, c, g, t, or u 62 ttagcttggn aaatttttct aaacacaatt gttatagcca gaggaacaaa gatgatataa 60 aatattgttg ctctgacaaa aatacatgta tttcattctc gtatggtgct agagttagat 120 taatctgcat tttaaaaaac tgaattggaa tagaattggt aagttgcaaa gactttttga 180 aaataattaa attatcatat cttccattcc tgttattgga gatgaaaata aaaagcaact 240 tatganagta g 251 63 252 DNA Artificial Sequence misc_feature (250) (250) n is a, c, g, t, or u 63 cttttttatg acaacttaga tcaactattt ttagcttggt aaatttttct aaacacaatt 60 gttatagcca gaggaacaaa gatgatataa aatattgttg ctctgacaaa aatacatgta 120 tttcattctc gtatggtgct agagttagat taatctgcat tttaaaaaac tgaattggaa 180 tagaattggt aagttgcaaa ggctttttga aaataattaa attatcatat cttccattcc 240 tgttattggn gg 252 64 245 DNA Artificial Sequence Description of Artificial Sequence Human EST 64 caaagttact aaaaaatacc acgaggtcct tcagttgaga ccaaagaccg gtgtcagggg 60 attgcacaaa tcactcaccg acgtggccct ggagcaccat gaggagtgtg actgtgtgtg 120 cagagggagc acaggaggat agccgcatca ccaccagcag ctcttgccca gagctgtgca 180 gtgcagtggc tgattctatt agagaacgta tgcgttatct ccatccttaa tctcagttgt 240 ttgct 245 65 245 DNA Artificial Sequence Description of Artificial Sequence Human EST 65 agataaacct gaaaagaaga gtggccttat cttcacttta tcgataagtc agtttatttg 60 tttcattgtg tacattttta tattctcctt ttgacattat aactgttggc ttttctaatc 120 ttgttaaata tatctatttt taccaaaggt atttaatatt cttttttatg acaacttaga 180 tcaactattt ttagcttggt aaatttttct aaacacaatt gttatagcca gaggaacaaa 240 gatga 245 66 243 DNA Artificial Sequence Description of Artificial Sequence Human EST 66 ctggattttt catatttctt attaaaattt ctgccattta gaagaagaga actacattca 60 tggtttggaa gagataaacc tgaaaagaag agtggcctta tcttcacttt atcgataagt 120 cagtttattt gtttcattgt gtacattttt atattctcct tttgacatta taactgttgg 180 cttttctaat cttgttaaat atatctattt ttaccaaagg tatttaatat tcttttttat 240 gac 243 67 244 DNA Artificial Sequence misc_feature (64) (64) n is a, c, g, t, or u 67 gctcatattc acatatgtaa accagaacat tctatgtact acaaacctgg tttttaaaaa 60 gganctatgt tgctatgaat taaacttgtg tcgtgctgat aggacagact ggatttttca 120 tatttcttat taaaatttct gccatttaga agaagagaac tacattcatg gtttggaaga 180 gataaacctg aaaagaagag tggccttatc ttcantttat cgataagtca gtttatttgt 240 ttca 244 68 247 DNA Artificial Sequence misc_feature (2) (2) n is a, c, g, t, or u 68 angagttgcc cagagctgtg cagtgcagtg gctgattcta ttagagaacg tatgcgttat 60 ctccatcctt aatctcagtt gtttgnttca aggacctttc atcttcagga tttacagtgc 120 attctgaaag aggagacatc aaacagaatt aggagttgtg caacagctct tttgagagga 180 ggcctaaagg ncaggagaaa aggtcttcaa tcgtggaaag aaaattaaat gttgtattaa 240 atagatc 247 69 233 DNA Artificial Sequence Description of Artificial Sequence Human EST 69 aaagatgata taaaatattg ttgctctgac aaaaatacat gtatttcatt ctcgtatggt 60 gctagagtta gattaatctg cattttaaaa aactgaattg gaatagaatt ggtaagttgc 120 aaagactttt tgaaaataat taaattatca tatcttccat tcctgttatt ggagatgaaa 180 ataaaaagca acttatgaaa gtagacattc agatccagcc attactaacc tat 233 70 232 DNA Artificial Sequence Description of Artificial Sequence Human EST 70 aggaaatcaa attaggataa gatttgtatc tgatgaatat tttccttctg aaccttctaa 60 cagaggaggt aagattatac agctgcacac ctcgtaactt ctcagtgtcc ataagggaag 120 aactaaagag aaccgatacc attttctggc caggttgtct cctggttaaa cgctgtggtg 180 ggaactgtgc ctgttgtctc cacaattgca atgaatgtca atgtgtccca ag 232 71 253 DNA Artificial Sequence Description of Artificial Sequence Human EST 71 gtgcattctg aaagaggaga catcaaacag aattaggagt tgtgcaacag ctcttttgag 60 aggaggccta aaggacagga gaaaaggtct tcaatcgtgg aaagaaaatt aaatgttgta 120 ttaaatagat caccagctag tttcagagtt accatgtacg tattccacta gctgggttct 180 gtatttcagt tctttcgata cggcttaggg taatgtcagt acaggaaaaa aactgtgcaa 240 gtgagcacct gat 253 72 233 DNA Artificial Sequence misc_feature (48) (48) n is a, c, g, t, or u 72 tgtacatttt tatattctcc ttttgacatt ataactgttg gcttttcnaa tcttgttaaa 60 tatatctatt tttaccaaag gtatttaata ttctttttta tgacaactta gatcaactat 120 ttttagcttg gtaaattttt ctaaacacaa ttgttatagc cagaggaaca aagatgatat 180 aaaatattgt tgctctgaca aaaatacatg tatttcattc tcgtatggtg cta 233 73 250 DNA Artificial Sequence misc_feature (53) (53) n is a, c, g, t, or u 73 cacaattgtt atagccagag gaacaaagat gatataaaat attgttgctc tgncaaaaat 60 acatgtattt cattctcgta tggtgctaga gttagattaa tctgcatttt aaaaaactga 120 attggaatag aattggtaag ttgcaaagac tttttgaaaa taattaaatt atcatatctt 180 ccattcctgt tattggagat gaaaataaaa agcaacttat gaaagtaaat tcagatccac 240 cattactaac 250 74 247 DNA Artificial Sequence Description of Artificial Sequence Human EST 74 atttcattct cgtatggtgc tagagttaga ttaatctgca ttttaaaaaa ctgaattgga 60 atagaattgg taagttgcaa agactttttg aaaataatta aattatcata tcttccattc 120 ctgttattgg agatgaaaat aaaaagcaac ttatgaaagt agacattcag atccagccat 180 tactaaccta ttcctttttt ggggaaatct gagcctagct cagaaaaaca taaagcacct 240 tgaaaaa 247 75 265 DNA Artificial Sequence misc_feature (238) (238) n is a, c, g, t, or u 75 tgcaacagct cttttgagag gaggcctaaa ggacaggaga aaaggtcttc aatcgtggaa 60 agaaaattaa atgttgtatt aaatagatca ccagctagtt tcagagttac catgtacgta 120 ttccactagc tgggttctgt atttcagttc tttcgatacg gcttagggta atgtcagtac 180 aggaaaaaaa ctgtgcaagt gagcacctga ttccgttgcc ttgcttaacc ctaaagcncc 240 atgtcnnggg cnaaaancga aaaat 265 76 251 DNA Artificial Sequence misc_feature (134) (134) n is a, c, g, t, or u 76 tttctaaaca caattgttat agccagagga acaaagatga tataaaatat tgttgctctg 60 acaaaaatac atgtatttca ttctcgtatg gtgctagagt tagattaatc tgcattttaa 120 aaaactgaat tggnatagaa ttggtaagtt gcaaagnctt tttgaaaata attaaattat 180 catatcttcc attcctgtta ttggaggatg gaaaataaaa agcaacttat ggaaagtagg 240 acattcagat c 251 77 291 DNA Artificial Sequence misc_feature (61) (61) n is a, c, g, t, or u 77 ccttaatctc agttgtttgc ttcaaggacc tttcatcttc aggatttaca gtgcattctg 60 naagangaga catcaaacag aattaggngt tgtgcaaaag ctcttttgag aggaggccta 120 aaggacagga gaaaaggtct ncaatcgtgg aaagnaaatt aaatgttgta tnaaatngat 180 caccagctag tttcagagtt accatgtacg tattccacta gctgggncng tattcagtct 240 ttcggaacgg cttagggtaa tgtcagtaca gganaaaaac tgtgcagtga g 291 78 253 DNA Artificial Sequence misc_feature (84) (84) n is a, c, g, t, or u 78 gtactacaaa cctggttttt aaaaaggaac tatgttgcta tgaattaaac ttgtgtccat 60 gctgatagga cagactggat tttncatatt tcttattaaa atttctgcca tttagaagaa 120 gagaactaca ttcatggttt ggnagagata aacctgaaaa gaagagtggc cttatcttca 180 ctttatcgat aagtcagttt atttgtttca tgtgtacatt tttatattct cctttgacat 240 ataacgtggc ttt 253 79 204 DNA Artificial Sequence misc_feature (190) (190) n is a, c, g, t, or u 79 ttatattctc cttttgacat tataactgtt ggcttttcta atcttgttaa atatatctat 60 ttttaccaaa ggtatttaat attctttttt atgacaactt agatcaacta tttttagctt 120 ggtaaatttt tctaaacaca attgttatag ccagaggaac aaagatgata taaaatattg 180 ttgctctgan aaaaatacat gtat 204 80 303 DNA Artificial Sequence misc_feature (2) (2) n is a, c, g, t, or u 80 anactgtgca agtgagcacc tgattccgtt gccttgctta actctaaagc tccatgtcct 60 gggcctaaaa tcgtataaaa tctggannnn nnnnnnnnnn nnnngctcat attcacatat 120 gtaaaccaga acattctatg tactacaaac ctggttttta aaaaggaact atgttgctat 180 gaattaaact tgtgtcgtgc tgataggaca gactggattt ttcatatttc ttattaaaat 240 ttctgccatt agaagaagag aactacnttc anggtttgga agagataacc ctgaaaagan 300 ggg 303 81 228 DNA Artificial Sequence misc_feature (112) (112) n is a, c, g, t, or u 81 gctcatattc acatatgtaa accagaacat tctatgtact acaaacctgg tttttaaaaa 60 ggaactattt gctatgaatt aaacttgtgt cgtgctgata ggacagactg gntttttcat 120 atttcttatt anaatttctg ccattagaag aagagaacta cattcatggt ttggaagaga 180 taaacctgaa aagaagagtg gcctatttca ctttatcgat aagtcagt 228 82 193 DNA Artificial Sequence Description of Artificial Sequence Human EST 82 gctcatattc acatatgtaa accagaacat tctatgtact acaaacctgg tttttaaaaa 60 ggaactatgt tgctatgaat taaacttgtg tcgtgctgat aggacagact ggatttttca 120 tatttcttat taaaatttct gccatttaga agaagagaac tacattcatg gtttggaaga 180 gataaacctg aaa 193 83 282 DNA Artificial Sequence misc_feature (42) (42) n is a, c, g, t, or u 83 aaaaaactga attggaatag aattggtaag ttgcaaagac tntttgaaaa taattaaatt 60 atcatatctt ccattcctgt tattggagat gaanataaaa agcaacttat gaaagtagac 120 attcagatcc agccattact aacctattcc ttttttgggg aaatctgagc ctagctcaga 180 aaaacataaa gcaccttgaa aaagacttgg cagcttcctg ataaagcgtg ctgtntgtca 240 gtaggaacac atcctattta ttgtgatgnt gtggtttatt at 282 84 279 DNA Artificial Sequence misc_feature (205) (205) n is a, c, g, t, or u 84 attaaataga tcaccagcta gtttcagagt taccatgtac gtattccact agctgggttc 60 tgtatttcag ttctttcgat acggcttagg gtaatgtcag tacaggaaaa aaactgtgca 120 agtgagcacc tgattccgtt gccttggctt aactctaaag ctccatgtcc tgggcctaaa 180 atcgtataaa atctggattt ttttnttttt ttttgcgcat attcacatat gtaaaccagn 240 acattctatg tacnacaaac ctggttttta aaaaggaac 279 85 181 DNA Artificial Sequence Description of Artificial Sequence Human EST 85 ggctagtttc agagttacca tgtacgtatt ccactagctg ggttctgtat ttcagttctt 60 tcgatacggc ttagggtaat gtcagtacag gaaaaaaact gtgcaagtga gcacctgatt 120 ccgttgcctt gcttaactct aaagctccat gtcctgggcc taaaatcgta taaaatctgg 180 a 181 86 269 DNA Artificial Sequence Description of Artificial Sequence Human EST 86 tggtaagttg caaagacttt ttgaaaataa ttaaattatc atatcttcca ttcctgttat 60 tggagatgaa aataaaaagc aacttatgaa agtagacatt cagatccagc cattactaac 120 ctattccttt tttggggaaa tctgagccta gctcagaaaa acataaagca ccttgaaaaa 180 gacttggcag cttcctgata aagcgtgctg tgctgtgcag tagggaacac atcctattta 240 ttgtgatgtt gtggtttata tcctaaacc 269 87 184 DNA Artificial Sequence misc_feature (54) (54) n is a, c, g, t, or u 87 aatagatcac cagctagttt cagagttacc atgtacgtat tccactagct gggntctgta 60 tttcagttcc tttcgatacg gcttagggta atgtcagtac aggaaaaaag ctgtgcaagt 120 gagcacctga ttccgttgcc ttgcttaact ctaaagctcc atgtcctggg cctaaaatcg 180 tata 184 88 164 DNA Artificial Sequence misc_feature (31) (31) n is a, c, g, t, or u 88 agataaacct gaaaagaaga gtggccttat nttcacttta tcgataagtc agnttatttg 60 tttcattgtg tacatttnna tattctcctt ttgacattat aactgntggc ttttctaanc 120 ntgttaaata tatctatttt taccaaaggt atttaatatt cttt 164 89 143 DNA Artificial Sequence Description of Artificial Sequence Human EST 89 tatggtgcta gagttagatt aatctgcatt ttaaaaaact gaattggaat agaattggta 60 agttgcaaag actttttgaa aataattaaa ttatcatatc ttccattcct gttattggag 120 atgaaaataa aaagcaactt atg 143 90 164 DNA Artificial Sequence misc_feature (7) (7) n is a, c, g, t, or u 90 ttttttnttt tgctcatatt cacatatgta aaccngaaca ttctatgtac nacaaacctg 60 gtttttaaaa aggaactatg ttgctatgaa ttaaacttgt gtcgtgctga taggacagac 120 tggatttttc anatttctta ntaanntttc tgccatttag aaga 164 91 244 DNA Artificial Sequence misc_feature (98) (115) n is a, c, g, t, or u 91 gtacaggaaa aaaactgtgc aagtgagcac ctgattccgt tgccttgctt aactctaaag 60 ctccatgtcc tgggcctaaa atcgtataaa atctggannn nnnnnnnnnn nnnnngctca 120 tattcacata tgtaaaccag aacattctat gtactacaaa cctggttttt aaaaaggaac 180 tatgttgcta tgaattaaac ttgtgtcgtg ctgataggac agactggatt tttcatattt 240 ctta 244 92 254 DNA Artificial Sequence misc_feature (16) (16) n is a, c, g, t, or u 92 gcaaagactt tttganaatn attaanttat catatcttcc attcctgtta tnggagatga 60 naataaaaag caacttatga aagtagacat tcagatccag ccattactaa cctattcctt 120 ttttggggaa atctgagcct agcncagaaa aacataaagc accttgaaaa agacttggca 180 gcttcctgat aaagcgtgct gtgctgtgca gtaggaacac atccnattta ttgtgntgtn 240 gnggttttat gatc 254 93 243 DNA Artificial Sequence misc_feature (103) (120) n is a, c, g, t, or u 93 tgtcagtaca ggaaaaaaac tgtgcaagtg agcacctgat tccgttgcct tgcttaactc 60 taaagctcca tgtcctgggc ctaaaatcgt ataaaatctg gannnnnnnn nnnnnnnnnn 120 gctcatattc acatatgtaa accagaacat tctatgtact acaaacctgg tttttaaaaa 180 ggaactatgt tgctatgaat taaacttgtg tcatgctgat aggacagact ggatttttca 240 tat 243 94 244 DNA Artificial Sequence misc_feature (36) (36) n is a, c, g, t, or u 94 aattatcata tcttccattc ctgttattgg agatgnaaat aaaaagcaac ttatgaaagt 60 agacattcag atccagccat tactaaccta ttcctttttt ggggaaatct gagcctagct 120 cagaaaaaca taaagcacct tgaaaaagac tgtcagcttc ctgataaagc gtgctgtgct 180 gtgcagtagg aacacatcct atttattgtg atgttgtggt tttattatct taaactcgtt 240 ccat 244 95 152 DNA Artificial Sequence misc_feature (2) (2) n is a, c, g, t, or u 95 anagatgata taaaanattg ttgctctgac aannatacat gtatttcatt ctcgtatggt 60 gctagagtta gattaatctg cnttttaaaa aactganttg gaatagantt ggtaagttgc 120 aaagncnttt gaaaatnatt aagttatcag at 152 96 292 DNA Artificial Sequence Description of Artificial Sequence Human EST 96 ttccattcct gttattggag atgaaaataa aaagcaactt atgaaagtag acattcagat 60 ccagccatta ctaacctatt ccttttttgg ggaaatctga gcctagctca gaaaaacata 120 aagcaccttg aaaaagactt ggcagcttcc tgataaagcg tgctgtgctg tgcagtagga 180 acacatccta tttattgtga tgttgtggtt ttattatcta aactctgttc catacacttg 240 tataaataca tggatatttt tatgtacaga agtatgtctc ttaaccagtt ca 292 97 308 DNA Artificial Sequence misc_feature (46) (46) n is a, c, g, t, or u 97 cttccattcc tgttattgga gatgaaaata aaaagcaact tatganagta gacattcaga 60 tccagccatt actaacctat tccttttttg gggaaatctg agcctagctc agaaaaacat 120 aaagcacctt gaaaaagact tggcagcttc ctgataaagc gtgctgtgct gtgcagtagg 180 aacacatcct atttattgtg atgttgtggt tttattatct taaactctgt tccatacact 240 tgtataaata catggatatt tttatgtaca gaagtatgtc tcttaaccag ttcacttatt 300 gtacctgg 308 98 1878 DNA Homo sapiens 98 aaaatgtatg gatacaactt acgtttgatg aaagatttgg gcttgaagac ccagaagatg 60 acatatgcaa gtatgatttt gtagaagttg aggaacccag tgatggaact atattagggc 120 gctggtgtgg ttctggtact gtaccaggaa aacagatttc taaaggaaat caaattagga 180 taagatttgt atctgatgaa tattttcctt ctgaaccttc taacagagga ggtaagatta 240 tacagctgca cacctcgtaa cttctcagtg tccataaggg aagaactaaa gagaaccgat 300 accattttct ggccaggttg tctcctggtt aaacgctgtg gtgggaactg tgcctgttgt 360 ctccacaatt gcaatgaatg tcaatgtgtc ccaagcaaag ttactaaaaa ataccacgag 420 gtccttcagt tgagaccaaa gaccggtgtc aggggattgc acaaatcact caccgacgtg 480 gccctggagc accatgagga gtgtgactgt gtgtgcagag ggagcacagg aggatagccg 540 catcaccacc agcagctctt gcccagagct gtgcagtgca gtggctgatt ctattagaga 600 acgtatgcgt tatctccatc cttaatctca gttgtttgct tcaaggacct ttcatcttca 660 ggatttacag tgcattctga aagaggagac atcaaacaga attaggagtt gtgcaacagc 720 tcttttgaga ggaggcctaa aggacaggag aaaaggtctt caatcgtgga aagaaaatta 780 aatgttgtat taaatagatc accagctagt ttcagagtta ccatgtacgt attccactag 840 ctgggttctg tatttcagtt ctttcgatac ggcttagggt aatgtcagta caggaaaaaa 900 actgtgcaag tgagcacctg attccgttgc cttgcttaac tctaaagctc catgtcctgg 960 gcctaaaatc gtataaaatc tggatttttt tttttttttt tgctcatatt cacatatgta 1020 aaccagaaca ttctatgtac tacaaacctg gtttttaaaa aggaactatg ttgctatgaa 1080 ttaaacttgt gtcgtgctga taggacagac tggatttttc atatttctta ttaaaatttc 1140 tgccatttag aagaagagaa ctacattcat ggtttggaag agataaacct gaaaagaaga 1200 gtggccttat cttcacttta tcgataagtc agtttatttg tttcattgtg tacattttta 1260 tattctcctt ttgacattat aactgttggc ttttctaatc ttgttaaata tatctatttt 1320 taccaaaggt atttaatatt cttttttatg acaacttaga tcaactattt ttagcttggt 1380 aaatttttct aaacacaatt gttatagcca gaggaacaaa gatgatataa aatattgttg 1440 ctctgacaaa aatacatgta tttcattctc gtatggtgct agagttagat taatctgcat 1500 tttaaaaaac tgaattggaa tagaattggt aagttgcaaa gactttttga aaataattaa 1560 attatcatat cttccattcc tgttattgga gatgaaaata aaaagcaact tatgaaagta 1620 gacattcaga tccagccatt actaacctat tccttttttg gggaaatctg agcctagctc 1680 agaaaaacat aaagcacctt gaaaaagact tggcagcttc ctgataaagc gtgctgtgct 1740 gtgcagtagg aacacatcct atttattgtg atgttgtggt tttattatct taaactctgt 1800 tccatacact tgtataaata catggatatt tttatgtaca gaagtatgtc tcttaaccag 1860 ttcacttatt gtacctgg 1878 99 113 PRT Homo sapiens 99 Met Asn Ile Phe Leu Leu Asn Leu Leu Thr Glu Glu Val Arg Leu Tyr 1 5 10 15 Ser Cys Thr Pro Arg Asn Phe Ser Val Ser Ile Arg Glu Glu Leu Lys 20 25 30 Arg Thr Asp Thr Ile Phe Trp Pro Gly Cys Leu Leu Val Lys Arg Cys 35 40 45 Gly Gly Asn Cys Ala Cys Cys Leu His Asn Cys Asn Glu Cys Gln Cys 50 55 60 Val Pro Ser Lys Val Thr Lys Lys Tyr His Glu Val Leu Gln Leu Arg 65 70 75 80 Pro Lys Thr Gly Val Arg Gly Leu His Lys Ser Leu Thr Asp Val Ala 85 90 95 Leu Glu His His Glu Glu Cys Asp Cys Val Cys Arg Gly Ser Thr Gly 100 105 110 Gly 100 2475 DNA Homo sapiens 100 tgccagagca ggtgggcgct tccaccccag tgcagccttc ccctggcggt ggtgaaagag 60 actcgggagt cgctgcttcc aaagtgcccg ccgtgagtga gctctcaccc cagtcagcca 120 aatgagcctc ttcgggcttc tcctgctgac atctgccctg gccggccaga gacaggggac 180 tcaggcggaa tccaacctga gtagtaaatt ccagttttcc agcaacaagg aacagaacgg 240 agtacaagat cctcagcatg agagaattat tactgtgtct actaatggaa gtattcacag 300 cccaaggttt cctcatactt atccaagaaa tacggtcttg gtatggagat tagtagcagt 360 agaggaaaat gtatggatac aacttacgtt tgatgaaaga tttgggcttg aagacccaga 420 agatgacata tgcaagtatg attttgtaga agttgaggaa cccagtgatg gaactatatt 480 agggcgctgg tgtggttctg gtactgtacc aggaaaacag atttctaaag gaaatcaaat 540 taggataaga tttgtatctg atgaatattt tccttctgaa ccagggttct gcatccacta 600 caacattgtc atgccacaat tcacagaagc tgtgagtcct tcagtgctac ccccttcagc 660 tttgccactg gacctgctta ataatgctat aactgccttt agtaccttgg aagaccttat 720 tcgatatctt gaaccagaga gatggcagtt ggacttagaa gatctatata ggccaacttg 780 gcaacttctt ggcaaggctt ttgtttttgg aagaaaatcc agagtggtgg atctgaacct 840 tctaacagag gaggtaagat tatacagctg cacacctcgt aacttctcag tgtccataag 900 ggaagaacta aagagaaccg ataccatttt ctggccaggt tgtctcctgg ttaaacgctg 960 tggtgggaac tgtgcctgtt gtctccacaa ttgcaatgaa tgtcaatgtg tcccaagcaa 1020 agttactaaa aaataccacg aggtccttca gttgagacca aagaccggtg tcaggggatt 1080 gcacaaatca ctcaccgacg tggccctgga gcaccatgag gagtgtgact gtgtgtgcag 1140 agggagcaca ggaggatagc cgcatcacca ccagcagctc ttgcccagag ctgtgcagtg 1200 cagtggctga ttctattaga gaacgtatgc gttatctcca tccttaatct cagttgtttg 1260 cttcaaggac ctttcatctt caggatttac agtgcattct gaaagaggag acatcaaaca 1320 gaattaggag ttgtgcaaca gctcttttga gaggaggcct aaaggacagg agaaaaggtc 1380 ttcaatcgtg gaaagaaaat taaatgttgt attaaataga tcaccagcta gtttcagagt 1440 taccatgtac gtattccact agctgggttc tgtatttcag ttctttcgat acggcttagg 1500 gtaatgtcag tacaggaaaa aaactgtgca agtgagcacc tgattccgtt gccttgctta 1560 actctaaagc tccatgtcct gggcctaaaa tcgtataaaa tctggatttt tttttttttt 1620 tttgctcata ttcacatatg taaaccagaa cattctatgt actacaaacc tggtttttaa 1680 aaaggaacta tgttgctatg aattaaactt gtgtcgtgct gataggacag actggatttt 1740 tcatatttct tattaaaatt tctgccattt agaagaagag aactacattc atggtttgga 1800 agagataaac ctgaaaagaa gagtggcctt atcttcactt tatcgataag ccagtttatt 1860 tgtttcattg tgtacatttt tatattctcc ttttgacatt ataactgttg gcttttctaa 1920 tcttgttaaa tatatctatt tttaccaaag gtatttaata ttctttttta tgacaactta 1980 gatcaactat ttttagcttg gtaaattttt ctaaacacaa ttgttatagc cagaggaaca 2040 aagatgatat aaaatattgt tgctctgaca aaaatacatg tatttcattc tcgtatggtg 2100 ctagagttag attaatctgc attttaaaaa actgaattgg aatagaattg gtaagttgca 2160 aagacttttt gaaaataatt aaattatcat atcttccatt cctgttattg gagatgaaaa 2220 taaaaagcaa cttatgaaag tagacattca gatccagcca ttactaacct attccttttt 2280 tggggaaatc tgagcctagc tcagaaaaac ataaagcacc ttgaaaaaga cttggcagct 2340 tcctgataaa gcgtgctgtg ctgtgcagta ggaacacatc ctatttattg tgatgttgtg 2400 gttttattat cttaaactct gttccataca cttgtataaa tacatggata tttttatgta 2460 cagaagtatg tctct 2475 101 345 PRT Homo sapiens 101 Met Ser Leu Phe Gly Leu Leu Leu Leu Thr Ser Ala Leu Ala Gly Gln 1 5 10 15 Arg Gln Gly Thr Gln Ala Glu Ser Asn Leu Ser Ser Lys Phe Gln Phe 20 25 30 Ser Ser Asn Lys Glu Gln Asn Gly Val Gln Asp Pro Gln His Glu Arg 35 40 45 Ile Ile Thr Val Ser Thr Asn Gly Ser Ile His Ser Pro Arg Phe Pro 50 55 60 His Thr Tyr Pro Arg Asn Thr Val Leu Val Trp Arg Leu Val Ala Val 65 70 75 80 Glu Glu Asn Val Trp Ile Gln Leu Thr Phe Asp Glu Arg Phe Gly Leu 85 90 95 Glu Asp Pro Glu Asp Asp Ile Cys Lys Tyr Asp Phe Val Glu Val Glu 100 105 110 Glu Pro Ser Asp Gly Thr Ile Leu Gly Arg Trp Cys Gly Ser Gly Thr 115 120 125 Val Pro Gly Lys Gln Ile Ser Lys Gly Asn Gln Ile Arg Ile Arg Phe 130 135 140 Val Ser Asp Glu Tyr Phe Pro Ser Glu Pro Gly Phe Cys Ile His Tyr 145 150 155 160 Asn Ile Val Met Pro Gln Phe Thr Glu Ala Val Ser Pro Ser Val Leu 165 170 175 Pro Pro Ser Ala Leu Pro Leu Asp Leu Leu Asn Asn Ala Ile Thr Ala 180 185 190 Phe Ser Thr Leu Glu Asp Leu Ile Arg Tyr Leu Glu Pro Glu Arg Trp 195 200 205 Gln Leu Asp Leu Glu Asp Leu Tyr Arg Pro Thr Trp Gln Leu Leu Gly 210 215 220 Lys Ala Phe Val Phe Gly Arg Lys Ser Arg Val Val Asp Leu Asn Leu 225 230 235 240 Leu Thr Glu Glu Val Arg Leu Tyr Ser Cys Thr Pro Arg Asn Phe Ser 245 250 255 Val Ser Ile Arg Glu Glu Leu Lys Arg Thr Asp Thr Ile Phe Trp Pro 260 265 270 Gly Cys Leu Leu Val Lys Arg Cys Gly Gly Asn Cys Ala Cys Cys Leu 275 280 285 His Asn Cys Asn Glu Cys Gln Cys Val Pro Ser Lys Val Thr Lys Lys 290 295 300 Tyr His Glu Val Leu Gln Leu Arg Pro Lys Thr Gly Val Arg Gly Leu 305 310 315 320 His Lys Ser Leu Thr Asp Val Ala Leu Glu His His Glu Glu Cys Asp 325 330 335 Cys Val Cys Arg Gly Ser Thr Gly Gly 340 345 102 2776 DNA Homo sapiens 102 atttgtttaa accttgggaa actggttcag gtccaggttt tgctttgatc cttttcaaaa 60 actggagaca cagaagaggg cttctaggaa aaagttttgg gatgggatta tgtggaaact 120 accctgcgat tctctgctgc cagagcaggc tcggcgcttc caccccagtg cagccttccc 180 ctggcggtgg tgaaagagac tcgggagtcg ctgcttccaa agtgcccgcc gtgagtgagc 240 tctcacccca gtcagccaaa tgagcctctt cgggcttctc ctgctgacat ctgccctggc 300 cggccagaga caggggactc aggcggaatc caacctgagt agtaaattcc agttttccag 360 caacaaggaa cagtacggag tacaagatcc tcagcatgag agaattatta ctgtgtctac 420 taatggaagt attcacagcc caaggtttcc tcatacttat ccaagaaata cggtcttggt 480 atggagatta gtagcagtag aggaaaatgt atggatacaa cttacgtttg atgaaagatt 540 tgggcttgaa gacccagaag atgacatatg caagtatgat tttgtagaag ttgaggaacc 600 cagtgatgga actatattag ggcgctggtg tggttctggt actgtaccag gaaaacagat 660 ttctaaagga aatcaaatta ggataagatt tgtatctgat gaatattttc cttctgaacc 720 agggttctgc atccactaca acattgtcat gccacaattc acagaagctg tgagtccttc 780 agtgctaccc ccttcagctt tgccactgga cctgcttaat aatgctataa ctgcctttag 840 taccttggaa gaccttattc gatatcttga accagagaga tggcagttgg acttagaaga 900 tctatatagg ccaacttggc aacttcttgg caaggctttt gtttttggaa gaaaatccag 960 agtggtggat ctgaaccttc taacagagga ggtaagatta tacagctgca cacctcgtaa 1020 cttctcagtg tccataaggg aagaactaaa gagaaccgat accattttct ggccaggttg 1080 tctcctggtt aaacgctgtg gtgggaactg tgcctgttgt ctccacaatt gcaatgaatg 1140 tcaatgtgtc ccaagcaaag ttactaaaaa ataccacgag gtccttcagt tgagaccaaa 1200 gaccggtgtc aggggattgc acaaatcact caccgacgtg gccctggagc accatgagga 1260 gtgtgactgt gtgtgcagag ggagcacagg aggatagccg catcaccacc agcagctctt 1320 gcccagagct gtgcagtgca gtggctgatt ctattagaga acgtatgcgt tatctccatc 1380 cttaatctca gttgtttgct tcaaggacct ttcatcttca ggatttacag tgcattctga 1440 aagaggagac atcaaacaga attaggagtt gtgcaacagc tcttttgaga ggaggcctaa 1500 aggacaggag aaaaggtctt caatcgtgga aagaaaatta aatgttgtat taaatagatc 1560 accagctagt ttcagagtta ccatgtacgt attccactag ctgggttctg tatttcagtt 1620 ctttcgatac ggcttagggt aatgtcagta caggaaaaaa actgtgcaag tgagcacctg 1680 attccgttgc cttggcttaa ctctaaagct ccatgtcctg ggcctaaaat cgtataaaat 1740 ctggattttt tttttttttt ttgcgcatat tcacatatgt aaaccagaac attctatgta 1800 ctacaaacct ggtttttaaa aaggaactat gttgctatga attaaacttg tgtcatgctg 1860 ataggacaga ctggattttt catatttctt attaaaattt ctgccattta gaagaagaga 1920 actacattca tggtttggaa gagataaacc tgaaaagaag agtggcctta tcttcacttt 1980 atcgataagt cagtttattt gtttcattgt gtacattttt atattctcct tttgacatta 2040 taactgttgg cttttctaat cttgttaaat atatctattt ttaccaaagg tatttaatat 2100 tcttttttat gacaacttag atcaactatt tttagcttgg taaatttttc taaacacaat 2160 tgttatagcc agaggaacaa agatgatata aaatattgtt gctctgacaa aaatacatgt 2220 atttcattct cgtatggtgc tagagttaga ttaatctgca ttttaaaaaa ctgaattgga 2280 atagaattgg taagttgcaa agactttttg aaaataatta aattatcata tcttccattc 2340 ctgttattgg agatgaaaat aaaaagcaac ttatgaaagt agacattcag atccagccat 2400 tactaaccta ttcctttttt ggggaaatct gagcctagct cagaaaaaca taaagcacct 2460 tgaaaaagac ttggcagctt cctgataaag cgtgctgtgc tgtgcagtag gaacacatcc 2520 tatttattgt gatgttgtgg ttttattatc ttaaactctg ttccatacac ttgtataaat 2580 acatggatat ttttatgtac agaagtatgt ctcttaacca gttcacttat tgtactctgg 2640 caatttaaaa gaaaatcagt aaaatatttt gcttgtaaaa tgcttaatat cgtgcctagg 2700 ttatgtggtg actatttgaa tcaaaaatgt attgaatcat caaataaaag aatgtggcta 2760 ttttggggag aaaatt 2776 103 345 PRT Homo sapiens 103 Met Ser Leu Phe Gly Leu Leu Leu Leu Thr Ser Ala Leu Ala Gly Gln 1 5 10 15 Arg Gln Gly Thr Gln Ala Glu Ser Asn Leu Ser Ser Lys Phe Gln Phe 20 25 30 Ser Ser Asn Lys Glu Gln Tyr Gly Val Gln Asp Pro Gln His Glu Arg 35 40 45 Ile Ile Thr Val Ser Thr Asn Gly Ser Ile His Ser Pro Arg Phe Pro 50 55 60 His Thr Tyr Pro Arg Asn Thr Val Leu Val Trp Arg Leu Val Ala Val 65 70 75 80 Glu Glu Asn Val Trp Ile Gln Leu Thr Phe Asp Glu Arg Phe Gly Leu 85 90 95 Glu Asp Pro Glu Asp Asp Ile Cys Lys Tyr Asp Phe Val Glu Val Glu 100 105 110 Glu Pro Ser Asp Gly Thr Ile Leu Gly Arg Trp Cys Gly Ser Gly Thr 115 120 125 Val Pro Gly Lys Gln Ile Ser Lys Gly Asn Gln Ile Arg Ile Arg Phe 130 135 140 Val Ser Asp Glu Tyr Phe Pro Ser Glu Pro Gly Phe Cys Ile His Tyr 145 150 155 160 Asn Ile Val Met Pro Gln Phe Thr Glu Ala Val Ser Pro Ser Val Leu 165 170 175 Pro Pro Ser Ala Leu Pro Leu Asp Leu Leu Asn Asn Ala Ile Thr Ala 180 185 190 Phe Ser Thr Leu Glu Asp Leu Ile Arg Tyr Leu Glu Pro Glu Arg Trp 195 200 205 Gln Leu Asp Leu Glu Asp Leu Tyr Arg Pro Thr Trp Gln Leu Leu Gly 210 215 220 Lys Ala Phe Val Phe Gly Arg Lys Ser Arg Val Val Asp Leu Asn Leu 225 230 235 240 Leu Thr Glu Glu Val Arg Leu Tyr Ser Cys Thr Pro Arg Asn Phe Ser 245 250 255 Val Ser Ile Arg Glu Glu Leu Lys Arg Thr Asp Thr Ile Phe Trp Pro 260 265 270 Gly Cys Leu Leu Val Lys Arg Cys Gly Gly Asn Cys Ala Cys Cys Leu 275 280 285 His Asn Cys Asn Glu Cys Gln Cys Val Pro Ser Lys Val Thr Lys Lys 290 295 300 Tyr His Glu Val Leu Gln Leu Arg Pro Lys Thr Gly Val Arg Gly Leu 305 310 315 320 His Lys Ser Leu Thr Asp Val Ala Leu Glu His His Glu Glu Cys Asp 325 330 335 Cys Val Cys Arg Gly Ser Thr Gly Gly 340 345 104 1473 DNA Homo sapiens 104 tttgtttaaa ccttgggaaa ctggttcagg tccaggtttt gctttgatcc ttttcaaaaa 60 ctggagacac agaagagggc tctaggaaaa agttttggat gggattatgt ggaaactacc 120 ctgcgattct ctgctgccag agcaggctcg gcgcttccac cccagtgcag ccttcccctg 180 gcggtggtga aagagactcg ggagtcgctg cttccaaagt gcccgccgtg agtgagctct 240 caccccagtc agccaaatga gcctcttcgg gcttctcctg ctgacatctg ccctggccgg 300 ccagagacag gggactcagg cggaatccaa cctgagtagt aaattccagt tttccagcaa 360 caaggaacag aacggagtac aagatcctca gcatgagaga attattactg tgtctactaa 420 tggaagtatt cacagcccaa ggtttcctca tacttatcca agaaatacgg tcttggtatg 480 gagattagta gcagtagagg aaaatgtatg gatacaactt acgtttgatg aaagatttgg 540 gcttgaagac ccagaagatg acatatgcaa gtatgatttt gtagaagttg aggaacccag 600 tgatggaact atattagggc gctggtgtgg ttctggtact gtaccaggaa aacagatttc 660 taaaggaaat caaattagga taagatttgt atctgatgaa tattttcctt ctgaaccagg 720 gttctgcatc cactacaaca ttgtcatgcc acaattcaca gaagctgtga gtccttcagt 780 gctaccccct tcagctttgc cactggacct gcttaataat gctataactg cctttagtac 840 cttggaagac cttattcgat atcttgaacc agagagatgg cagttggact tagaagatct 900 atataggcca acttggcaac ttcttggcaa ggcttttgtt tttggaagaa aatccagagt 960 ggtggatctg aaccttctaa cagaggaggt aagattatac agctgcacac ctcgtaactt 1020 ctcagtgtcc ataagggaag aactaaagag aaccgatacc attttctggc caggttgtct 1080 cctggttaaa cgctgtggtg ggaactgtgc ctgttgtctc cacaattgca atgaatgtca 1140 atgtgtccca agcaaagtta ctaaaaaata ccacgaggtc cttcagttga gaccaaagac 1200 cggtgtcagg ggattgcaca aatcactcac cgacgtggcc ctggagcacc atgaggagtg 1260 tgactgtgtg tgcagaggga gcacaggagg atagccgcat caccaccagc agctcttgcc 1320 cagagctgtg cagtgcagtg gctgattcta ttagagaacg tatgcgttat ctccatcctt 1380 aatctcagtt gtttgcttca aggacctttc atcttcagga tttacagtgc attctgaaag 1440 aggagacatc aaacagaatt aggagttgtg caa 1473 105 215 PRT Homo sapiens 105 Met Asn Phe Leu Leu Ser Trp Val His Trp Ser Leu Ala Leu Leu Leu 1 5 10 15 Tyr Leu His His Ala Lys Trp Ser Gln Ala Ala Pro Met Ala Glu Gly 20 25 30 Gly Gly Gln Asn His His Glu Val Val Lys Phe Met Asp Val Tyr Gln 35 40 45 Arg Ser Tyr Cys His Pro Ile Glu Thr Leu Val Asp Ile Phe Gln Glu 50 55 60 Tyr Pro Asp Glu Ile Glu Tyr Ile Phe Lys Pro Ser Cys Val Pro Leu 65 70 75 80 Met Arg Cys Gly Gly Cys Cys Asn Asp Glu Gly Leu Glu Cys Val Pro 85 90 95 Thr Glu Glu Ser Asn Ile Thr Met Gln Ile Met Arg Ile Lys Pro His 100 105 110 Gln Gly Gln His Ile Gly Glu Met Ser Phe Leu Gln His Asn Lys Cys 115 120 125 Glu Cys Arg Pro Lys Lys Asp Arg Ala Arg Gln Glu Lys Lys Ser Val 130 135 140 Arg Gly Lys Gly Lys Gly Gln Lys Arg Lys Arg Lys Lys Ser Arg Tyr 145 150 155 160 Lys Ser Trp Ser Val Pro Cys Gly Pro Cys Ser Glu Arg Arg Lys His 165 170 175 Leu Phe Val Gln Asp Pro Gln Thr Cys Lys Cys Ser Cys Lys Asn Thr 180 185 190 Asp Ser Arg Cys Lys Ala Arg Gln Leu Glu Leu Asn Glu Arg Thr Cys 195 200 205 Arg Cys Asp Lys Pro Arg Arg 210 215 106 149 PRT Homo sapiens 106 Met Pro Val Met Arg Leu Phe Pro Cys Phe Leu Gln Leu Leu Ala Gly 1 5 10 15 Leu Ala Leu Pro Ala Val Pro Pro Gln Gln Trp Ala Leu Ser Ala Gly 20 25 30 Asn Gly Ser Ser Glu Val Glu Val Val Pro Phe Gln Glu Val Trp Gly 35 40 45 Arg Ser Tyr Cys Arg Ala Leu Glu Arg Leu Val Asp Val Val Ser Glu 50 55 60 Tyr Pro Ser Glu Val Glu His Met Phe Ser Pro Ser Cys Val Ser Leu 65 70 75 80 Leu Arg Cys Thr Gly Cys Cys Gly Asp Glu Asn Leu His Cys Val Pro 85 90 95 Val Glu Thr Ala Asn Val Thr Met Gln Leu Leu Lys Ile Arg Ser Gly 100 105 110 Asp Arg Pro Ser Tyr Val Glu Leu Thr Phe Ser Gln His Val Arg Cys 115 120 125 Glu Cys Arg Pro Leu Arg Glu Lys Met Lys Pro Glu Arg Cys Gly Asp 130 135 140 Ala Val Pro Arg Arg 145 107 188 PRT Homo sapiens 107 Met Ser Pro Leu Leu Arg Arg Leu Leu Leu Ala Ala Leu Leu Gln Leu 1 5 10 15 Ala Pro Ala Gln Ala Pro Val Ser Gln Pro Asp Ala Pro Gly His Gln 20 25 30 Arg Lys Val Val Ser Trp Ile Asp Val Tyr Thr Arg Ala Thr Cys Gln 35 40 45 Pro Arg Glu Val Val Val Pro Leu Thr Val Glu Leu Met Gly Thr Val 50 55 60 Ala Lys Gln Leu Val Pro Ser Cys Val Thr Val Gln Arg Cys Gly Gly 65 70 75 80 Cys Cys Pro Asp Asp Gly Leu Glu Cys Val Pro Thr Gly Gln His Gln 85 90 95 Val Arg Met Gln Ile Leu Met Ile Arg Tyr Pro Ser Ser Gln Leu Gly 100 105 110 Glu Met Ser Leu Glu Glu His Ser Gln Cys Glu Cys Arg Pro Lys Lys 115 120 125 Lys Asp Ser Ala Val Lys Pro Asp Ser Pro Arg Pro Leu Cys Pro Arg 130 135 140 Cys Thr Gln His His Gln Arg Pro Asp Pro Arg Thr Cys Arg Cys Arg 145 150 155 160 Cys Arg Arg Arg Ser Phe Leu Arg Cys Gln Gly Arg Gly Leu Glu Leu 165 170 175 Asn Pro Asp Thr Cys Arg Cys Arg Lys Leu Arg Arg 180 185 108 419 PRT Homo sapiens 108 Met His Leu Leu Gly Phe Phe Ser Val Ala Cys Ser Leu Leu Ala Ala 1 5 10 15 Ala Leu Leu Pro Gly Pro Arg Glu Ala Pro Ala Ala Ala Ala Ala Phe 20 25 30 Glu Ser Gly Leu Asp Leu Ser Asp Ala Glu Pro Asp Ala Gly Glu Ala 35 40 45 Thr Ala Tyr Ala Ser Lys Asp Leu Glu Glu Gln Leu Arg Ser Val Ser 50 55 60 Ser Val Asp Glu Leu Met Thr Val Leu Tyr Pro Glu Tyr Trp Lys Met 65 70 75 80 Tyr Lys Cys Gln Leu Arg Lys Gly Gly Trp Gln His Asn Arg Glu Gln 85 90 95 Ala Asn Leu Asn Ser Arg Thr Glu Glu Thr Ile Lys Phe Ala Ala Ala 100 105 110 His Tyr Asn Thr Glu Ile Leu Lys Ser Ile Asp Asn Glu Trp Arg Lys 115 120 125 Thr Gln Cys Met Pro Arg Glu Val Cys Ile Asp Val Gly Lys Glu Phe 130 135 140 Gly Val Ala Thr Asn Thr Phe Phe Lys Pro Pro Cys Val Ser Val Tyr 145 150 155 160 Arg Cys Gly Gly Cys Cys Asn Ser Glu Gly Leu Gln Cys Met Asn Thr 165 170 175 Ser Thr Ser Tyr Leu Ser Lys Thr Leu Phe Glu Ile Thr Val Pro Leu 180 185 190 Ser Gln Gly Pro Lys Pro Val Thr Ile Ser Phe Ala Asn His Thr Ser 195 200 205 Cys Arg Cys Met Ser Lys Leu Asp Val Tyr Arg Gln Val His Ser Ile 210 215 220 Ile Arg Arg Ser Leu Pro Ala Thr Leu Pro Gln Cys Gln Ala Ala Asn 225 230 235 240 Lys Thr Cys Pro Thr Asn Tyr Met Trp Asn Asn His Ile Cys Arg Cys 245 250 255 Leu Ala Gln Glu Asp Phe Met Phe Ser Ser Asp Ala Gly Asp Asp Ser 260 265 270 Thr Asp Gly Phe His Asp Ile Cys Gly Pro Asn Lys Glu Leu Asp Glu 275 280 285 Glu Thr Cys Gln Cys Val Cys Arg Ala Gly Leu Arg Pro Ala Ser Cys 290 295 300 Gly Pro His Lys Glu Leu Asp Arg Asn Ser Cys Gln Cys Val Cys Lys 305 310 315 320 Asn Lys Leu Phe Pro Ser Gln Cys Gly Ala Asn Arg Glu Phe Asp Glu 325 330 335 Asn Thr Cys Gln Cys Val Cys Lys Arg Thr Cys Pro Arg Asn Gln Pro 340 345 350 Leu Asn Pro Gly Lys Cys Ala Cys Glu Cys Thr Glu Ser Pro Gln Lys 355 360 365 Cys Leu Leu Lys Gly Lys Lys Phe His His Gln Thr Cys Ser Cys Tyr 370 375 380 Arg Arg Pro Cys Thr Asn Arg Gln Lys Ala Cys Glu Pro Gly Phe Ser 385 390 395 400 Tyr Ser Glu Glu Val Cys Arg Cys Val Pro Ser Tyr Trp Lys Arg Pro 405 410 415 Gln Met Ser 109 354 PRT Homo sapiens 109 Met Tyr Arg Glu Trp Val Val Val Asn Val Phe Met Met Leu Tyr Val 1 5 10 15 Gln Leu Val Gln Gly Ser Ser Asn Glu His Gly Pro Val Lys Arg Ser 20 25 30 Ser Gln Ser Thr Leu Glu Arg Ser Glu Gln Gln Ile Arg Ala Ala Ser 35 40 45 Ser Leu Glu Glu Leu Leu Arg Ile Thr His Ser Glu Asp Trp Lys Leu 50 55 60 Trp Arg Cys Arg Leu Arg Leu Lys Ser Phe Thr Ser Met Asp Ser Arg 65 70 75 80 Ser Ala Ser His Arg Ser Thr Arg Phe Ala Ala Thr Phe Tyr Asp Ile 85 90 95 Glu Thr Leu Lys Val Ile Asp Glu Glu Trp Gln Arg Thr Gln Cys Ser 100 105 110 Pro Arg Glu Thr Cys Val Glu Val Ala Ser Glu Leu Gly Lys Ser Thr 115 120 125 Asn Thr Phe Phe Lys Pro Pro Cys Val Asn Val Phe Arg Cys Gly Gly 130 135 140 Cys Cys Asn Glu Glu Ser Leu Ile Cys Met Asn Thr Ser Thr Ser Tyr 145 150 155 160 Ile Ser Lys Gln Leu Phe Glu Ile Ser Val Pro Leu Thr Ser Val Pro 165 170 175 Glu Leu Val Pro Val Lys Val Ala Asn His Thr Gly Cys Lys Cys Leu 180 185 190 Pro Thr Ala Pro Arg His Pro Tyr Ser Ile Ile Arg Arg Ser Ile Gln 195 200 205 Ile Pro Glu Glu Asp Arg Cys Ser His Ser Lys Lys Leu Cys Pro Ile 210 215 220 Asp Met Leu Trp Asp Ser Asn Lys Cys Lys Cys Val Leu Gln Glu Glu 225 230 235 240 Asn Pro Leu Ala Gly Thr Glu Asp His Ser His Leu Gln Glu Pro Ala 245 250 255 Leu Cys Gly Pro His Met Met Phe Asp Glu Asp Arg Cys Glu Cys Val 260 265 270 Cys Lys Thr Pro Cys Pro Lys Asp Leu Ile Gln His Pro Lys Asn Cys 275 280 285 Ser Cys Phe Glu Cys Lys Glu Ser Leu Glu Thr Cys Cys Gln Lys His 290 295 300 Lys Leu Phe His Pro Asp Thr Cys Ser Cys Glu Asp Arg Cys Pro Phe 305 310 315 320 His Thr Arg Pro Cys Ala Ser Gly Lys Thr Ala Cys Ala Lys His Cys 325 330 335 Arg Phe Pro Lys Glu Lys Arg Ala Ala Gln Gly Pro His Ser Arg Lys 340 345 350 Asn Pro 110 345 PRT Homo sapiens 110 Met Ser Leu Phe Gly Leu Leu Leu Leu Thr Ser Ala Leu Ala Gly Gln 1 5 10 15 Arg Gln Gly Thr Gln Ala Glu Ser Asn Leu Ser Ser Lys Phe Gln Phe 20 25 30 Ser Ser Asn Lys Glu Gln Asn Gly Val Gln Asp Pro Gln His Glu Arg 35 40 45 Ile Ile Thr Val Ser Thr Asn Gly Ser Ile His Ser Pro Arg Phe Pro 50 55 60 His Thr Tyr Pro Arg Asn Thr Val Leu Val Trp Arg Leu Val Ala Val 65 70 75 80 Glu Glu Asn Val Trp Ile Gln Leu Thr Phe Asp Glu Arg Phe Gly Leu 85 90 95 Glu Asp Pro Glu Asp Asp Ile Cys Lys Tyr Asp Phe Val Glu Val Glu 100 105 110 Glu Pro Ser Asp Gly Thr Ile Leu Gly Arg Trp Cys Gly Ser Gly Thr 115 120 125 Val Pro Gly Lys Gln Ile Ser Lys Gly Asn Gln Ile Arg Ile Arg Phe 130 135 140 Val Ser Asp Glu Tyr Phe Pro Ser Glu Pro Gly Phe Cys Ile His Tyr 145 150 155 160 Asn Ile Val Met Pro Gln Phe Thr Glu Ala Val Ser Pro Ser Val Leu 165 170 175 Pro Pro Ser Ala Leu Pro Leu Asp Leu Leu Asn Asn Ala Ile Thr Ala 180 185 190 Phe Ser Thr Leu Glu Asp Leu Ile Arg Tyr Leu Glu Pro Glu Arg Trp 195 200 205 Gln Leu Asp Leu Glu Asp Leu Tyr Arg Pro Thr Trp Gln Leu Leu Gly 210 215 220 Lys Ala Phe Val Phe Gly Arg Lys Ser Arg Val Val Asp Leu Asn Leu 225 230 235 240 Leu Thr Glu Glu Val Arg Leu Tyr Ser Cys Thr Pro Arg Asn Phe Ser 245 250 255 Val Ser Ile Arg Glu Glu Leu Lys Arg Thr Asp Thr Ile Phe Trp Pro 260 265 270 Gly Cys Leu Leu Val Lys Arg Cys Gly Gly Asn Cys Ala Cys Cys Leu 275 280 285 His Asn Cys Asn Glu Cys Gln Cys Val Pro Ser Lys Val Thr Lys Lys 290 295 300 Tyr His Glu Val Leu Gln Leu Arg Pro Lys Thr Gly Val Arg Gly Leu 305 310 315 320 His Lys Ser Leu Thr Asp Val Ala Leu Glu His His Glu Glu Cys Asp 325 330 335 Cys Val Cys Arg Gly Ser Thr Gly Gly 340 345 111 167 PRT Homo sapiens 111 Met Ser Leu Phe Gly Leu Leu Leu Leu Thr Ser Ala Leu Ala Gly Gln 1 5 10 15 Arg Gln Gly Thr Gln Ala Glu Ser Asn Leu Ser Ser Lys Phe Gln Phe 20 25 30 Ser Ser Asn Lys Glu Gln Asn Gly Val Gln Asp Pro Gln His Glu Arg 35 40 45 Ile Ile Thr Val Ser Thr Asn Gly Ser Ile His Ser Pro Arg Phe Pro 50 55 60 His Thr Tyr Pro Arg Asn Thr Val Leu Val Trp Arg Leu Val Ala Val 65 70 75 80 Glu Glu Asn Val Trp Ile Gln Leu Thr Phe Asp Glu Arg Phe Gly Leu 85 90 95 Glu Asp Pro Glu Asp Asp Ile Cys Lys Tyr Asp Phe Val Glu Val Glu 100 105 110 Glu Pro Ser Asp Gly Thr Ile Leu Gly Arg Trp Cys Gly Ser Gly Thr 115 120 125 Val Pro Gly Lys Gln Ile Ser Lys Gly Asn Gln Ile Arg Ile Arg Phe 130 135 140 Val Ser Asp Glu Tyr Phe Pro Ser Glu Pro Ser Asn Arg Gly Gly Lys 145 150 155 160 Ile Ile Gln Leu His Thr Ser 165 112 282 PRT Homo sapiens 112 Met Ser Leu Phe Gly Leu Leu Leu Leu Thr Ser Ala Leu Ala Gly Gln 1 5 10 15 Arg Gln Gly Thr Gln Ala Glu Ser Asn Leu Ser Ser Lys Phe Gln Phe 20 25 30 Ser Ser Asn Lys Glu Gln Asn Gly Val Gln Asp Pro Gln His Glu Arg 35 40 45 Ile Ile Thr Val Ser Thr Asn Gly Ser Ile His Ser Pro Arg Phe Pro 50 55 60 His Thr Tyr Pro Arg Asn Thr Val Leu Val Trp Arg Leu Val Ala Val 65 70 75 80 Glu Glu Asn Val Trp Ile Gln Leu Thr Phe Asp Glu Arg Phe Gly Leu 85 90 95 Glu Asp Pro Glu Asp Asp Ile Cys Lys Tyr Asp Phe Val Glu Val Glu 100 105 110 Glu Pro Ser Asp Gly Thr Ile Leu Gly Arg Trp Cys Gly Ser Gly Thr 115 120 125 Val Pro Gly Lys Gln Ile Ser Lys Gly Asn Gln Ile Arg Ile Arg Phe 130 135 140 Val Ser Asp Glu Tyr Phe Pro Ser Glu Pro Gly Phe Cys Ile His Tyr 145 150 155 160 Asn Ile Val Met Pro Gln Phe Thr Glu Ala Val Ser Pro Ser Val Leu 165 170 175 Pro Pro Ser Ala Leu Pro Leu Asp Leu Leu Asn Asn Ala Ile Thr Ala 180 185 190 Phe Ser Thr Leu Glu Asp Leu Ile Arg Tyr Leu Glu Pro Glu Arg Trp 195 200 205 Gln Leu Asp Leu Glu Asp Leu Tyr Arg Pro Thr Trp Gln Leu Leu Gly 210 215 220 Lys Ala Phe Val Phe Gly Arg Lys Ser Arg Val Val Asp Leu Asn Leu 225 230 235 240 Leu Thr Glu Glu Val Leu Gln Leu Arg Pro Lys Thr Gly Val Arg Gly 245 250 255 Leu His Lys Ser Leu Thr Asp Val Ala Leu Glu His His Glu Glu Cys 260 265 270 Asp Cys Val Cys Arg Gly Ser Thr Gly Gly 275 280 113 822 DNA Homo sapiens 113 aggaaatcaa attaggataa gatttgtatc tgatgaatat tttccttctg aaccttctaa 60 cagaggaggt aagattatac agctgcacac ctcgtaactt ctcagtgtcc ataagggaag 120 aactaaagag aaccgatacc attttctggc caggttgtct cctggttaaa cgctgtggtg 180 ggaactgtgc ctgttgtctc cacaattgca atgaatgtca atgtgtccca agcaaagtta 240 ctaaaaaata ccacgaggtc cttcagttga gaccaaagac cggtgtcagg ggattgcaca 300 aatcactcac cgacgtggcc ctggagcacc atgaggagtg tgactgtgtg tgcagaggga 360 gcacaggagg atagccgcat caccaccagc agctcttgcc cagagctgtg cagtgcagtg 420 gctgattcta ttagagaacg tatgcgttat ctccatcctt aatctcagtt gtttgcttca 480 aggacctttc atcttcagga tttacagtgc attctgaaag aggagacatc aaacagaatt 540 aggagttgtg caacagctct tttgagagga ggcctaaagg acaggagaaa aggtcttcaa 600 tcgtggaaag aaaattaaat gttgtattaa atagatcacc agctagtttc agagttacca 660 tgtacgtatt ccactagctg ggttctgtat ttcagttctt tcgatacggc ttagggtaat 720 gtcagtacag gaaaaaaact gtgcaagtga gcacctgatt ccgttgcctt ggcttaactc 780 taaagctcca tgtcctgggc ctaaaatcgt ataaaatctg ga 822 114 227 PRT Homo sapiens 114 Met Asn Ile Phe Leu Leu Asn Leu Leu Thr Glu Glu Val Arg Leu Tyr 1 5 10 15 Ser Cys Thr Pro Arg Asn Phe Ser Val Ser Ile Arg Glu Glu Leu Lys 20 25 30 Arg Thr Asp Thr Ile Phe Trp Pro Gly Cys Leu Leu Val Lys Arg Cys 35 40 45 Gly Gly Asn Cys Ala Cys Cys Leu His Asn Cys Asn Glu Cys Gln Cys 50 55 60 Val Pro Ser Lys Val Thr Lys Lys Tyr His Glu Val Leu Gln Leu Arg 65 70 75 80 Pro Lys Thr Gly Val Arg Gly Leu His Lys Ser Leu Thr Asp Val Ala 85 90 95 Val Ser Gly Asp Cys Thr Asn His Ser Pro Thr Trp Pro Leu Glu His 100 105 110 His Glu Glu Cys Asp Cys Val Cys Arg Gly Ser Thr Gly Gly Val Gln 115 120 125 Arg Glu His Arg Arg Ile Ala Ala Ser Pro Pro Ala Ala Leu Ala Trp 130 135 140 Ser Thr Met Arg Ser Val Thr Val Cys Ala Glu Gly Ala Gln Glu Asp 145 150 155 160 Ser Arg Ile Thr Thr Ser Ser Ser Cys Gln Ser Cys Ala Val Gln Trp 165 170 175 Leu Ile Leu Leu Glu Asn Val Cys Val Ile Ser Ile Leu Asn Leu Ser 180 185 190 Cys Leu Leu Gln Pro Glu Leu Cys Ser Ala Val Ala Asp Ser Ile Arg 195 200 205 Glu Arg Met Arg Tyr Leu His Pro Gly Pro Phe Ile Phe Arg Ile Tyr 210 215 220 Ser Ala Phe 225 115 1716 DNA Homo sapiens misc_feature (830)..(830) n is a, c, g, or t 115 aggaaatcaa attaggataa gatttgtatc tgatgaatat tttccttctg aaccttctaa 60 cagaggaggt aagattatac agctgcacac ctcgtaactt ctcagtgtcc ataagggaag 120 aactaaagag aaccgatacc attttctggc caggttgtct cctggttaaa cgctgtggtg 180 ggaactgtgc ctgttgtctc cacaattgca atgaatgtca atgtgtccca agcaaagtta 240 ctaaaaaata ccacgaggtc cttcagttga gaccaaagac cggtgtcagg ggattgcaca 300 aatcactcac cgacgtggcc ctggagcacc atgaggagtg tgactgtgtg tgcagaggga 360 gcacaggagg atagccgcat caccaccagc agctcttgcc cagagctgtg cagtgcagtg 420 gctgattcta ttagagaacg tatgcgttat ctccatcctt aatctcagtt gtttgcttca 480 aggacctttc atcttcagga tttacagtgc attctgaaag aggagacatc aaacagaatt 540 aggagttgtg caacagctct tttgagagga ggcctaaagg acaggagaaa aggtcttcaa 600 tcgtggaaag aaaattaaat gttgtattaa atagatcacc agctagtttc agagttacca 660 tgtacgtatt ccactagctg ggttctgtat ttcagttctt tcgatacggc ttagggtaat 720 gtcagtacag gaaaaaaact gtgcaagtga gcacctgatt ccgttgcctt ggcttaactc 780 taaagctcca tgtcctgggc ctaaaatcgt ataaaatctg gatttttttn tttttttttg 840 cgcatattca catatgtaaa ccagaacatt ctatgtacta caaacctggt ttttaaaaag 900 gaactatgtt gctatgaatt aaacttgtgt cgtgctgata ggacagactg gatttttcat 960 atttcttatt aaaatttctg ccatttagaa gaagagaact acattcatgg tttggaagag 1020 ataaacctga aaagaagagt ggccttatct tcactttatc gataagtcag tttatttgtt 1080 tcattgtgta catttttata ttctcctttt gacattataa ctgttggctt ttctaatctt 1140 gttaaatata tctattttta ccaaaggtat ttaatattct tttttatgac aacttagatc 1200 aactattttt agcttggtaa atttttctaa acacaattgt tatagccaga ggaacaaaga 1260 tgatataaaa tattgttgct ctgacaaaaa tacatgtatt tcattctcgt atggtgctag 1320 agttagatta atctgcattt taaaaaactg aattggaata gaattggtaa gttgcaaaga 1380 ctttttgaaa ataattaaat tatcatatct tccattcctg ttattggaga tgaaaataaa 1440 aagcaactta tgaaagtaga cattcagatc cagccattac taacctattc cttttttggg 1500 gaaatctgag cctagctcag aaaaacataa agcaccttga aaaagacttg gcagcttcct 1560 gataaagcgt gctgtgctgt gcagtaggaa cacatcctat ttattgtgat gttgtggttt 1620 tattatctta aactctgttc catacacttg tataaataca tggatatttt tatgtacaga 1680 agtatgtctc ttaaccagtt cacttattgt acctgg 1716 116 227 PRT Homo sapiens 116 Met Asn Ile Phe Leu Leu Asn Leu Leu Thr Glu Glu Val Arg Leu Tyr 1 5 10 15 Ser Cys Thr Pro Arg Asn Phe Ser Val Ser Ile Arg Glu Glu Leu Lys 20 25 30 Arg Thr Asp Thr Ile Phe Trp Pro Gly Cys Leu Leu Val Lys Arg Cys 35 40 45 Gly Gly Asn Cys Ala Cys Cys Leu His Asn Cys Asn Glu Cys Gln Cys 50 55 60 Val Pro Ser Lys Val Thr Lys Lys Tyr His Glu Val Leu Gln Leu Arg 65 70 75 80 Pro Lys Thr Gly Val Arg Gly Leu His Lys Ser Leu Thr Asp Val Ala 85 90 95 Val Ser Gly Asp Cys Thr Asn His Ser Pro Thr Trp Pro Leu Glu His 100 105 110 His Glu Glu Cys Asp Cys Val Cys Arg Gly Ser Thr Gly Gly Val Gln 115 120 125 Arg Glu His Arg Arg Ile Ala Ala Ser Pro Pro Ala Ala Leu Ala Trp 130 135 140 Ser Thr Met Arg Ser Val Thr Val Cys Ala Glu Gly Ala Gln Glu Asp 145 150 155 160 Ser Arg Ile Thr Thr Ser Ser Ser Cys Gln Ser Cys Ala Val Gln Trp 165 170 175 Leu Ile Leu Leu Glu Asn Val Cys Val Ile Ser Ile Leu Asn Leu Ser 180 185 190 Cys Leu Leu Gln Pro Glu Leu Cys Ser Ala Val Ala Asp Ser Ile Arg 195 200 205 Glu Arg Met Arg Tyr Leu His Pro Gly Pro Phe Ile Phe Arg Ile Tyr 210 215 220 Ser Ala Phe 225 117 1134 DNA Homo sapiens 117 ggatccaaaa tgagcctctt cgggcttctc ctgctgacat ctgccctggc cggccagaga 60 caggggactc aggcggaatc caacctgagt agtaaattcc agttttccag caacaaggaa 120 cagaacggag tacaagatcc tcagcatgag agaattatta ctgtgtctac taatggaagt 180 attcacagcc caaggtttcc tcatacttat ccaagaaata cggtcttggt atggagatta 240 gtagcagtag aggaaaatgt atggatacaa cttacgtttg atgaaagatt tgggcttgaa 300 gacccagaag atgacatatg caagtatgat tttgtagaag ttgaggaacc cagtgatgga 360 actatattag ggcgctggtg tggttctggt actgtaccag gaaaacagat ttctaaagga 420 aatcaaatta ggataagatt tgtatctgat gaatattttc cttctgaacc agggttctgc 480 atccactaca acattgtcat gccacaattc acagaagctg tgagtccttc agtgctaccc 540 ccttcagctt tgccactgga cctgcttaat aatgctataa ctgcctttag taccttggaa 600 gaccttattc gatatcttga accagagaga tggcagttgg acttagaaga tctatatagg 660 ccaacttggc aacttcttgg caaggctttt gtttttggaa gaaaatccag agtggtggat 720 ctgaaccttc taacagagga ggtaagatta tacagctgca cacctcgtaa cttctcagtg 780 tccataaggg aagaactaaa gagaaccgat accattttct ggccaggttg tctcctggtt 840 aaacgctgtg gtgggaactg tgcctgttgt ctccacaatt gcaatgaatg tcaatgtgtc 900 ccaagcaaag ttactaaaaa ataccacgag gtccttcagt tgagaccaaa gaccggtgtc 960 aggggattgc acaaatcact caccgacgtg gccctggagc accatgagga gtgtgactgt 1020 gtgtgcagag ggagcacagg aggatctaga gggcccttcg aaggtaagcc tatccctaac 1080 cctctcctcg gtctcgattc tacgcgtacc ggtcatcatc accatcacca ttga 1134 118 374 PRT Homo sapiens 118 Met Ser Leu Phe Gly Leu Leu Leu Leu Thr Ser Ala Leu Ala Gly Gln 1 5 10 15 Arg Gln Gly Thr Gln Ala Glu Ser Asn Leu Ser Ser Lys Phe Gln Phe 20 25 30 Ser Ser Asn Lys Glu Gln Asn Gly Val Gln Asp Pro Gln His Glu Arg 35 40 45 Ile Ile Thr Val Ser Thr Asn Gly Ser Ile His Ser Pro Arg Phe Pro 50 55 60 His Thr Tyr Pro Arg Asn Thr Val Leu Val Trp Arg Leu Val Ala Val 65 70 75 80 Glu Glu Asn Val Trp Ile Gln Leu Thr Phe Asp Glu Arg Phe Gly Leu 85 90 95 Glu Asp Pro Glu Asp Asp Ile Cys Lys Tyr Asp Phe Val Glu Val Glu 100 105 110 Glu Pro Ser Asp Gly Thr Ile Leu Gly Arg Trp Cys Gly Ser Gly Thr 115 120 125 Val Pro Gly Lys Gln Ile Ser Lys Gly Asn Gln Ile Arg Ile Arg Phe 130 135 140 Val Ser Asp Glu Tyr Phe Pro Ser Glu Pro Gly Phe Cys Ile His Tyr 145 150 155 160 Asn Ile Val Met Pro Gln Phe Thr Glu Ala Val Ser Pro Ser Val Leu 165 170 175 Pro Pro Ser Ala Leu Pro Leu Asp Leu Leu Asn Asn Ala Ile Thr Ala 180 185 190 Phe Ser Thr Leu Glu Asp Leu Ile Arg Tyr Leu Glu Pro Glu Arg Trp 195 200 205 Gln Leu Asp Leu Glu Asp Leu Tyr Arg Pro Thr Trp Gln Leu Leu Gly 210 215 220 Lys Ala Phe Val Phe Gly Arg Lys Ser Arg Val Val Asp Leu Asn Leu 225 230 235 240 Leu Thr Glu Glu Val Arg Leu Tyr Ser Cys Thr Pro Arg Asn Phe Ser 245 250 255 Val Ser Ile Arg Glu Glu Leu Lys Arg Thr Asp Thr Ile Phe Trp Pro 260 265 270 Gly Cys Leu Leu Val Lys Arg Cys Gly Gly Asn Cys Ala Cys Cys Leu 275 280 285 His Asn Cys Asn Glu Cys Gln Cys Val Pro Ser Lys Val Thr Lys Lys 290 295 300 Tyr His Glu Val Leu Gln Leu Arg Pro Lys Thr Gly Val Arg Gly Leu 305 310 315 320 His Lys Ser Leu Thr Asp Val Ala Leu Glu His His Glu Glu Cys Asp 325 330 335 Cys Val Cys Arg Gly Ser Thr Gly Gly Ser Arg Gly Pro Phe Glu Gly 340 345 350 Lys Pro Ile Pro Asn Pro Leu Leu Gly Leu Asp Ser Thr Arg Thr Gly 355 360 365 His His His His His His 370 119 1134 DNA Homo sapiens 119 gaattcaaag gcctgtattt tactgttttc gtaacagttt tgtaataaaa aaacctataa 60 atatgaaatt cttagtcaac gttgcccttg tttttatggt cgtatacatt tcttacatct 120 atgcggatcc ggagtctcac catcaccacc atcatgaatc caacctgagt agtaaattcc 180 agttttccag caacaaggaa cagaacggag tacaagatcc tcagcatgag agaattatta 240 ctgtgtctac taatggaagt attcacagcc caaggtttcc tcatacttat ccaagaaata 300 cggtcttggt atggagatta gtagcagtag aggaaaatgt atggatacaa cttacgtttg 360 atgaaagatt tgggcttgaa gacccagaag atgacatatg caagtatgat tttgtagaag 420 ttgaggaacc cagtgatgga actatattag ggcgctggtg tggttctggt actgtaccag 480 gaaaacagat ttctaaagga aatcaaatta ggataagatt tgtatctgat gaatattttc 540 cttctgaacc agggttctgc atccactaca acattgtcat gccacaattc acagaagctg 600 tgagtccttc agtgctaccc ccttcagctt tgccactgga cctgcttaat aatgctataa 660 ctgcctttag taccttggaa gaccttattc gatatcttga accagagaga tggcagttgg 720 acttagaaga tctatatagg ccaacttggc aacttcttgg caaggctttt gtttttggaa 780 gaaaatccag agtggtggat ctgaaccttc taacagagga ggtaagatta tacagctgca 840 cacctcgtaa cttctcagtg tccataaggg aagaactaaa gagaaccgat accattttct 900 ggccaggttg tctcctggtt aaacgctgtg gtgggaactg tgcctgttgt ctccacaatt 960 gcaatgaatg tcaatgtgtc ccaagcaaag ttactaaaaa ataccacgag gtccttcagt 1020 tgagaccaaa gaccggtgtc aggggattgc acaaatcact caccgacgtg gccctggagc 1080 accatgagga gtgtgactgt gtgtgcagag ggagcacagg aggatagctc taga 1134 120 354 PRT Homo sapiens 120 Met Lys Phe Leu Val Asn Val Ala Leu Val Phe Met Val Val Tyr Ile 1 5 10 15 Ser Tyr Ile Tyr Ala Asp Pro Glu Ser His His His His His His Glu 20 25 30 Ser Asn Leu Ser Ser Lys Phe Gln Phe Ser Ser Asn Lys Glu Gln Asn 35 40 45 Gly Val Gln Asp Pro Gln His Glu Arg Ile Ile Thr Val Ser Thr Asn 50 55 60 Gly Ser Ile His Ser Pro Arg Phe Pro His Thr Tyr Pro Arg Asn Thr 65 70 75 80 Val Leu Val Trp Arg Leu Val Ala Val Glu Glu Asn Val Trp Ile Gln 85 90 95 Leu Thr Phe Asp Glu Arg Phe Gly Leu Glu Asp Pro Glu Asp Asp Ile 100 105 110 Cys Lys Tyr Asp Phe Val Glu Val Glu Glu Pro Ser Asp Gly Thr Ile 115 120 125 Leu Gly Arg Trp Cys Gly Ser Gly Thr Val Pro Gly Lys Gln Ile Ser 130 135 140 Lys Gly Asn Gln Ile Arg Ile Arg Phe Val Ser Asp Glu Tyr Phe Pro 145 150 155 160 Ser Glu Pro Gly Phe Cys Ile His Tyr Asn Ile Val Met Pro Gln Phe 165 170 175 Thr Glu Ala Val Ser Pro Ser Val Leu Pro Pro Ser Ala Leu Pro Leu 180 185 190 Asp Leu Leu Asn Asn Ala Ile Thr Ala Phe Ser Thr Leu Glu Asp Leu 195 200 205 Ile Arg Tyr Leu Glu Pro Glu Arg Trp Gln Leu Asp Leu Glu Asp Leu 210 215 220 Tyr Arg Pro Thr Trp Gln Leu Leu Gly Lys Ala Phe Val Phe Gly Arg 225 230 235 240 Lys Ser Arg Val Val Asp Leu Asn Leu Leu Thr Glu Glu Val Arg Leu 245 250 255 Tyr Ser Cys Thr Pro Arg Asn Phe Ser Val Ser Ile Arg Glu Glu Leu 260 265 270 Lys Arg Thr Asp Thr Ile Phe Trp Pro Gly Cys Leu Leu Val Lys Arg 275 280 285 Cys Gly Gly Asn Cys Ala Cys Cys Leu His Asn Cys Asn Glu Cys Gln 290 295 300 Cys Val Pro Ser Lys Val Thr Lys Lys Tyr His Glu Val Leu Gln Leu 305 310 315 320 Arg Pro Lys Thr Gly Val Arg Gly Leu His Lys Ser Leu Thr Asp Val 325 330 335 Ala Leu Glu His His Glu Glu Cys Asp Cys Val Cys Arg Gly Ser Thr 340 345 350 Gly Gly 121 1097 DNA Homo sapiens 121 cgcagactaa ttcgagctcg aacaacaaca acaataacaa taacaacaac ctcgggatcg 60 gagggaagga tttcagaatt cgaatccaac ctgagtagta aattccagtt ttccagcaac 120 aaggaacaga acggagtaca agatcctcag catgagagaa ttattactgt gtctactaat 180 ggaagtattc acagcccaag gtttcctcat acttatccaa gaaatacggt cttggtatgg 240 agattagtag cagtagagga aaatgtatgg atacaactta cgtttgatga aagatttggg 300 cttgaagacc cagaagatga catatgcaag tatgattttg tagaagttga ggaacccagt 360 gatggaacta tattagggcg ctggtgtggt tctggtactg taccaggaaa acagatttct 420 aaaggaaatc aaattaggat aagatttgta tctgatgaat attttccttc tgaaccaggg 480 ttctgcatcc actacaacat tgtcatgcca caattcacag aagctgtgag tccttcagtg 540 ctaccccctt cagctttgcc actggacctg cttaataatg ctataactgc ctttagtacc 600 ttggaagacc ttattcgata tcttgaacca gagagatggc agttggactt agaagatcta 660 tataggccaa cttggcaact tcttggcaag gcttttgttt ttggaagaaa atccagagtg 720 gtggatctga accttctaac agaggaggta agattataca gctgcacacc tcgtaacttc 780 tcagtgtcca taagggaaga actaaagaga accgatacca ttttctggcc aggttgtctc 840 ctggttaaac gctgtggtgg gaactgtgcc tgttgtctcc acaattgcaa tgaatgtcaa 900 tgtgtcccaa gcaaagttac taaaaaatac cacgaggtcc ttcagttgag accaaagacc 960 ggtgtcaggg gattgcacaa atcactcacc gacgtggccc tggagcacca tgaggagtgt 1020 gactgtgtgt gcagagggag cacaggagga catcatcacc atcaccattg atctagagtc 1080 gacctgcagg caagctt 1097 122 355 PRT Homo sapiens 122 Gln Thr Asn Ser Ser Ser Asn Asn Asn Asn Asn Asn Asn Asn Asn Asn 1 5 10 15 Leu Gly Ile Glu Gly Arg Ile Ser Glu Phe Glu Ser Asn Leu Ser Ser 20 25 30 Lys Phe Gln Phe Ser Ser Asn Lys Glu Gln Asn Gly Val Gln Asp Pro 35 40 45 Gln His Glu Arg Ile Ile Thr Val Ser Thr Asn Gly Ser Ile His Ser 50 55 60 Pro Arg Phe Pro His Thr Tyr Pro Arg Asn Thr Val Leu Val Trp Arg 65 70 75 80 Leu Val Ala Val Glu Glu Asn Val Trp Ile Gln Leu Thr Phe Asp Glu 85 90 95 Arg Phe Gly Leu Glu Asp Pro Glu Asp Asp Ile Cys Lys Tyr Asp Phe 100 105 110 Val Glu Val Glu Glu Pro Ser Asp Gly Thr Ile Leu Gly Arg Trp Cys 115 120 125 Gly Ser Gly Thr Val Pro Gly Lys Gln Ile Ser Lys Gly Asn Gln Ile 130 135 140 Arg Ile Arg Phe Val Ser Asp Glu Tyr Phe Pro Ser Glu Pro Gly Phe 145 150 155 160 Cys Ile His Tyr Asn Ile Val Met Pro Gln Phe Thr Glu Ala Val Ser 165 170 175 Pro Ser Val Leu Pro Pro Ser Ala Leu Pro Leu Asp Leu Leu Asn Asn 180 185 190 Ala Ile Thr Ala Phe Ser Thr Leu Glu Asp Leu Ile Arg Tyr Leu Glu 195 200 205 Pro Glu Arg Trp Gln Leu Asp Leu Glu Asp Leu Tyr Arg Pro Thr Trp 210 215 220 Gln Leu Leu Gly Lys Ala Phe Val Phe Gly Arg Lys Ser Arg Val Val 225 230 235 240 Asp Leu Asn Leu Leu Thr Glu Glu Val Arg Leu Tyr Ser Cys Thr Pro 245 250 255 Arg Asn Phe Ser Val Ser Ile Arg Glu Glu Leu Lys Arg Thr Asp Thr 260 265 270 Ile Phe Trp Pro Gly Cys Leu Leu Val Lys Arg Cys Gly Gly Asn Cys 275 280 285 Ala Cys Cys Leu His Asn Cys Asn Glu Cys Gln Cys Val Pro Ser Lys 290 295 300 Val Thr Lys Lys Tyr His Glu Val Leu Gln Leu Arg Pro Lys Thr Gly 305 310 315 320 Val Arg Gly Leu His Lys Ser Leu Thr Asp Val Ala Leu Glu His His 325 330 335 Glu Glu Cys Asp Cys Val Cys Arg Gly Ser Thr Gly Gly His His His 340 345 350 His His His 355 123 500 DNA Homo sapiens 123 aaggagatat acatatgcgg ggttctcatc atcatcatca tcatggtatg gctagcatga 60 ctggtggaca gcaaatgggt cgggatctgt acgacgatga cgataaggat ccgggaagaa 120 aatccagagt ggtggatctg aaccttctaa cagaggaggt aagattatac agctgcacac 180 ctcgtaactt ctcagtgtcc ataagggaag aactaaagag aaccgatacc attttctggc 240 caggttgtct cctggttaaa cgctgtggtg ggaactgtgc ctgttgtctc cacaattgca 300 atgaatgtca atgtgtccca agcaaagtta ctaaaaaata ccacgaggtc cttcagttga 360 gaccaaagac cggtgtcagg ggattgcaca aatcactcac cgacgtggcc ctggagcacc 420 atgaggagtg tgactgtgtg tgcagaggga gcacaggagg ataatgaatt cgaagcttga 480 tccggctgct aacaaagccc 500 124 149 PRT Homo sapiens 124 Met Arg Gly Ser His His His His His His Gly Met Ala Ser Met Thr 1 5 10 15 Gly Gly Gln Gln Met Gly Arg Asp Leu Tyr Asp Asp Asp Asp Lys Asp 20 25 30 Pro Gly Arg Lys Ser Arg Val Val Asp Leu Asn Leu Leu Thr Glu Glu 35 40 45 Val Arg Leu Tyr Ser Cys Thr Pro Arg Asn Phe Ser Val Ser Ile Arg 50 55 60 Glu Glu Leu Lys Arg Thr Asp Thr Ile Phe Trp Pro Gly Cys Leu Leu 65 70 75 80 Val Lys Arg Cys Gly Gly Asn Cys Ala Cys Cys Leu His Asn Cys Asn 85 90 95 Glu Cys Gln Cys Val Pro Ser Lys Val Thr Lys Lys Tyr His Glu Val 100 105 110 Leu Gln Leu Arg Pro Lys Thr Gly Val Arg Gly Leu His Lys Ser Leu 115 120 125 Thr Asp Val Ala Leu Glu His His Glu Glu Cys Asp Cys Val Cys Arg 130 135 140 Gly Ser Thr Gly Gly 145 125 550 DNA Homo sapiens 125 ggcgatggcc atggatatcg gaattaattc ggatccggag tctcaccatc accaccatca 60 tgaatccaac ctgagtagta aattccagtt ttccagcaac aaggaacaga acggagtaca 120 agatcctcag catgagagaa ttattactgt gtctactaat ggaagtattc acagcccaag 180 gtttcctcat acttatccaa gaaatacggt cttggtatgg agattagtag cagtagagga 240 aaatgtatgg atacaactta cgtttgatga aagatttggg cttgaagacc cagaagatga 300 catatgcaag tatgattttg tagaagttga ggaacccagt gatggaacta tattagggcg 360 ctggtgtggt tctggtactg taccaggaaa acagatttct aaaggaaatc aaattaggat 420 aagatttgta tctgatgaat attttccttc tgaaccaggg ttctgcatcc actacaacat 480 tgtcatgcca caattcacag aagctgtgta gtcgagctcc gtcgacaagc ttgcggccgc 540 actcgagcac 550 126 168 PRT Homo sapiens 126 Met Ala Met Asp Ile Gly Ile Asn Ser Asp Pro Glu Ser His His His 1 5 10 15 His His His Glu Ser Asn Leu Ser Ser Lys Phe Gln Phe Ser Ser Asn 20 25 30 Lys Glu Gln Asn Gly Val Gln Asp Pro Gln His Glu Arg Ile Ile Thr 35 40 45 Val Ser Thr Asn Gly Ser Ile His Ser Pro Arg Phe Pro His Thr Tyr 50 55 60 Pro Arg Asn Thr Val Leu Val Trp Arg Leu Val Ala Val Glu Glu Asn 65 70 75 80 Val Trp Ile Gln Leu Thr Phe Asp Glu Arg Phe Gly Leu Glu Asp Pro 85 90 95 Glu Asp Asp Ile Cys Lys Tyr Asp Phe Val Glu Val Glu Glu Pro Ser 100 105 110 Asp Gly Thr Ile Leu Gly Arg Trp Cys Gly Ser Gly Thr Val Pro Gly 115 120 125 Lys Gln Ile Ser Lys Gly Asn Gln Ile Arg Ile Arg Phe Val Ser Asp 130 135 140 Glu Tyr Phe Pro Ser Glu Pro Gly Phe Cys Ile His Tyr Asn Ile Val 145 150 155 160 Met Pro Gln Phe Thr Glu Ala Val 165 127 542 DNA Homo sapiens 127 tttcttttat accatatagt ggtggatctg aaccagggtt ctgcatccac tacaacattg 60 tcatgccaca attcacagaa gctgtgagtc cttcagtgct acccccttca gctttgccac 120 tggacctgct taataatgct ataactgcct ttagtacctt ggaagacctt attcgatatc 180 ttgaaccaga gagatggcag ttggacttag aagatctata taggccaact tggcaacttc 240 ttggcaaggc ttttgttttt ggaagaaaat ccagagtggt ggatctgaac cttctaacag 300 aggaggtaag attatacagc tgcacacctc gtaacttctc agtgtccata agggaagaac 360 taaagagaac cgataccatt ttctggccag gttgtctcct ggttaaacgc tgtggtggga 420 actgtgcctg ttgtctccac aattgcaatg aatgtcaatg tgtcccaagc aaagttacta 480 aaaaatacca cgaggtaggt atacaatttt ctttttggtt tccttcgggt attttatgtc 540 tt 542 128 1710 DNA Homo sapiens 128 aaagccagtc atagacattc gttgattttt aaaagtggct tactcttatt ccctttcagg 60 tccttcagtt gagaccaaag accggtgtca ggggattgca caaatcactc accgacgtgg 120 ccctggagca ccatgaggag tgtgactgtg tgtgcagagg gagcacagga ggatagccgc 180 atcaccacca gcagctcttg cccagagctg tgcagtgcag tggctgattc tattagagaa 240 cgtatgcgtt atctccatcc ttaatctcag ttgtttgctt caaggacctt tcatcttcag 300 gatttacagt gcattctgaa agaggagaca tcaaacagaa ttaggagttg tgcaacagct 360 cttttgagag gaggcctaaa ggacaggaga aaaggtcttc aatcgtggaa agaaaattaa 420 atgttgtatt aaatagatca ccagctagtt tcagagttac catgtacgta ttccactagc 480 tgggttctgt atttcagttc tttcgatacg gcttagggta atgtcagtac aggaaaaaaa 540 ctgtgcaagt gagcacctga ttccgttgcc ttggcttaac tctaaagctc catgtcctgg 600 gcctaaaatc gtataaaatc tggatttttt tttttttttt tgcgcatatt cacatatgta 660 aaccagaaca ttctatgtac tacaaacctg gtttttaaaa aggaactatg ttgctatgaa 720 ttaaacttgt gtcatgctga taggacagac tggatttttc atatttctta ttaaaatttc 780 tgccatttag aagaagagaa ctacattcat ggtttggaag agataaacct gaaaagaaga 840 gtggccttat cttcacttta tcgataagtc agtttatttg tttcattgtg tacattttta 900 tattctcctt ttgacattat aactgttggc ttttctaatc ttgttaaata tatctatttt 960 taccaaaggt atttaatatt cttttttatg acaacttaga tcaactattt ttagcttggt 1020 aaatttttct aaacacaatt gttatagcca gaggaacaaa gatgatataa aatattgttg 1080 ctctgacaaa aatacatgta tttcattctc gtatggtgct agagttagat taatctgcat 1140 tttaaaaaac tgaattggaa tagaattggt aagttgcaaa gactttttga aaataattaa 1200 attatcatat cttccattcc tgttattgga gatgaaaata aaaagcaact tatgaaagta 1260 gacattcaga tccagccatt actaacctat tccttttttg gggaaatctg agcctagctc 1320 agaaaaacat aaagcacctt gaaaaagact tggcagcttc ctgataaagc gtgctgtgct 1380 gtgcagtagg aacacatcct atttattgtg atgttgtggt tttattatct taaactctgt 1440 tccatacact tgtataaata catggatatt tttatgtaca gaagtatgtc tcttaaccag 1500 ttcacttatt gtactctggc aatttaaaag aaaatcagta aaatattttg cttgtaaaat 1560 gcttaatatc gtgcctaggt tatgtggtga ctatttgaat caaaaatgta ttgaatcatc 1620 aaataaaaga atgtggctat tttggggaga aaattatgtg tgtgtgtgct caagatttat 1680 ttcttggact ctgagaaaat gaaagataaa 1710 129 2668 DNA Homo sapiens 129 gaattcgccc ttttgtttaa accttgggaa ctggttcagg tccaggtttt gctttgatcc 60 ttttcaaaaa ctggagacac agaagagggc tctaggaaaa agttttggat gggattatgt 120 ggaaactacc ctgcgattct ctgctgccag agcaggctcg gcgcttccac cccagtgcag 180 ccttcccctg gcggtggtga aagagactcg ggagtcgctg cttccaaagt gcccgccgtg 240 agtgagctct caccccagtc agccaaatga gcctcttcgg gcttctcctg ctgacatctg 300 ccctggccgg ccagagacag gggactcagg cggaatccaa cctgagtagt aaattccagt 360 tttccagcaa caaggaacag aacggagtac aagatcctca gcatgagaga attattactg 420 tgtctactaa tggaagtatt cacagcccaa ggtttcctca tacttatcca agaaatacgg 480 tcttggtatg gagattagta gcagtagagg aaaatgtatg gatacaactt acgtttgatg 540 aaagatttgg gcttgaagac ccagaagatg acatatgcaa gtatgatttt gtagaagttg 600 aggaacccag tgatggaact atattagggc gctggtgtgg ttctggtact gtaccaggaa 660 aacagatttc taaaggaaat caaattagga taagatttgt atctgatgaa tattttcctt 720 ctgaaccagg gttctgcatc cactacaaca ttgtcatgcc acaattcaca gaagctgtga 780 gtccttcagt gctaccccct tcagctttgc cactggacct gcttaataat gctataactg 840 cctttagtac cttggaagac cttattcgat atcttgaacc agagagatgg cagttggact 900 tagaagatct atataggcca acttggcaac ttcttggcaa ggcttttgtt tttggaagaa 960 aatccagagt ggtggatctg aaccttctaa cagaggaggt aagattatac agctgcacac 1020 ctcgtaactt ctcagtgtcc ataagggaag aactaaagag aaccgatacc attttctggc 1080 caggttgtct cctggttaaa cgctgtggtg ggaactgtgc ctgttgtctc cacaattgca 1140 atgaatgtca atgtgtccca agcaaagtta ctaaaaaata ccacgaggtc cttcagttga 1200 gaccaaagac cggtgtcagg ggattgcaca aatcactcac cgacgtggcc ctggagcacc 1260 atgaggagtg tgactgtgtg tgcagaggga gcacaggagg atagccgcat caccaccagc 1320 agctcttgcc cagagctgtg cagtgcagtg gctgattcta ttagagaacg tatgcgttat 1380 ctccatcctt aatctcagtt gtttgcttca aggacctttc atcttcagga tttacagtgc 1440 attctgaaag aggagacatc aaacagaatt aggagttgtg caacagctct tttgagagga 1500 ggcctaaagg acaggagaaa aggtcttcaa tcgtggaaag aaaattaaat gttgtattaa 1560 atagatcacc agctagtttc agagttacca tgtacgtatt ccactagctg ggttctgtat 1620 ttcagttctt tcgatacggc ttagggtaat gtcagtacag gaaaaaaact gtgcaagtga 1680 gcacctgatt ccgttgcctt gcttaactct aaagctccat gtcctgggcc taaaatcgta 1740 taaaatctgg attttttttt tttttttttg ctcatattca catatgtaaa ccagaacatt 1800 ctatgtacta caaacctggt ttttaaaaag gaactatgtt gctatgaatt aaacttgtgt 1860 catgctgata ggacagactg gatttttcat atttcttatt aaaatttctg ccatttagaa 1920 gaagagaact acattcatgg tttggaagag ataaacctga aaagaagagt ggccttatct 1980 tcactttatc gataagtcag tttatttgtt tcattgtgta catttttata ttctcctttt 2040 gacattataa ctgttggctt ttctaatctt gttaaatata tctattttta ccaaaggtat 2100 ttaatattct tttttatgac aacttagatc aactattttt agcttggtaa atttttctaa 2160 acacaattgt tatagccaga ggaacaaaga tgatataaaa tattgttgct ctgacaaaaa 2220 tacatgtatt tcattctcgt atggtgctag agttagatta atctgcattt taaaaaactg 2280 aattggaata gaattggtaa gttgcaaaga ctttttgaaa ataattaaat tatcatatct 2340 tccattcctg ttattggaga tgaaaataaa aagcaactta tgaaagtaga cattcagatc 2400 cagccattac taacctattc cttttttggg gaaatctgag cctagctcag aaaaacataa 2460 agcaccttga aaaagacttg gcagcttcct gataaagcgt gctgtgctgt gcagtaggaa 2520 cacatcctat ttattgtgat gttgtggttt tattatctta aactctgttc catacacttg 2580 tataaataca tggatatttt tatgtacaga agtatgtctc ttaaccagtt cacttattgt 2640 acctggaagg gcgaattctg cagatatc 2668 130 345 PRT Homo sapiens 130 Met Ser Leu Phe Gly Leu Leu Leu Leu Thr Ser Ala Leu Ala Gly Gln 1 5 10 15 Arg Gln Gly Thr Gln Ala Glu Ser Asn Leu Ser Ser Lys Phe Gln Phe 20 25 30 Ser Ser Asn Lys Glu Gln Asn Gly Val Gln Asp Pro Gln His Glu Arg 35 40 45 Ile Ile Thr Val Ser Thr Asn Gly Ser Ile His Ser Pro Arg Phe Pro 50 55 60 His Thr Tyr Pro Arg Asn Thr Val Leu Val Trp Arg Leu Val Ala Val 65 70 75 80 Glu Glu Asn Val Trp Ile Gln Leu Thr Phe Asp Glu Arg Phe Gly Leu 85 90 95 Glu Asp Pro Glu Asp Asp Ile Cys Lys Tyr Asp Phe Val Glu Val Glu 100 105 110 Glu Pro Ser Asp Gly Thr Ile Leu Gly Arg Trp Cys Gly Ser Gly Thr 115 120 125 Val Pro Gly Lys Gln Ile Ser Lys Gly Asn Gln Ile Arg Ile Arg Phe 130 135 140 Val Ser Asp Glu Tyr Phe Pro Ser Glu Pro Gly Phe Cys Ile His Tyr 145 150 155 160 Asn Ile Val Met Pro Gln Phe Thr Glu Ala Val Ser Pro Ser Val Leu 165 170 175 Pro Pro Ser Ala Leu Pro Leu Asp Leu Leu Asn Asn Ala Ile Thr Ala 180 185 190 Phe Ser Thr Leu Glu Asp Leu Ile Arg Tyr Leu Glu Pro Glu Arg Trp 195 200 205 Gln Leu Asp Leu Glu Asp Leu Tyr Arg Pro Thr Trp Gln Leu Leu Gly 210 215 220 Lys Ala Phe Val Phe Gly Arg Lys Ser Arg Val Val Asp Leu Asn Leu 225 230 235 240 Leu Thr Glu Glu Val Arg Leu Tyr Ser Cys Thr Pro Arg Asn Phe Ser 245 250 255 Val Ser Ile Arg Glu Glu Leu Lys Arg Thr Asp Thr Ile Phe Trp Pro 260 265 270 Gly Cys Leu Leu Val Lys Arg Cys Gly Gly Asn Cys Ala Cys Cys Leu 275 280 285 His Asn Cys Asn Glu Cys Gln Cys Val Pro Ser Lys Val Thr Lys Lys 290 295 300 Tyr His Glu Val Leu Gln Leu Arg Pro Lys Thr Gly Val Arg Gly Leu 305 310 315 320 His Lys Ser Leu Thr Asp Val Ala Leu Glu His His Glu Glu Cys Asp 325 330 335 Cys Val Cys Arg Gly Ser Thr Gly Gly 340 345 

What is claimed is:
 1. An isolated nucleic acid molecule encoding a polypeptide having a CUB domain, wherein said polypeptide has anti-proliferative activity and consists essentially of the amino acid sequence of SEQ ID NO:26.
 2. An isolated nucleic acid molecule encoding a polypeptide having a CUB domain, said polypeptide consisting essentially of the amino acid sequence of SEQ ID NO:27.
 3. An isolated nucleic acid molecule according to claim 2, consisting essentially of the nucleotide sequence of SEQ ID NO:28.
 4. An expression vector comprising an isolated nucleic acid molecule according to claim 1 or claim
 2. 5. An isolated host cell transformed or transfected with an expression vector according to claim
 4. 6. An isolated nucleic acid molecule encoding a polypeptide that has anti-proliferative activity and consists essentially of the amino acid sequence of SEQ ID NO:26 for use as a medicament.
 7. A pharmaceutical composition comprising the isolated nucleic acid molecule according to any of claim 1 to 3, together with a pharmaceutically acceptable carrier, diluent or excipient therefor. 